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Abstract 

We consider the problem of designing mechanisms that interact with strategic agents through 
strategic intermediaries (or mediators), and investigate the cost to society due to the mediators’ 
strategic behavior. Selfish agents with private information are each associated with exactly 
one strategic mediator, and can interact with the mechanism exclusively through that media¬ 
tor. Each mediator aims to optimize the combined utility of his agents, while the mechanism 
aims to optimize the combined utility of all agents. We focus on the problem of facility loca¬ 
tion on a metric induced by a publicly known tree. With non-strategic mediators, there is a 
dominant strategy mechanism that is optimal. We show that when both agents and mediators 
act strategically, there is no dominant strategy mechanism that achieves any approximation. 
We, thus, slightly relax the incentive constraints, and define the notion of a two-sided incentive 
compatible mechanism. We show that the 3-conrpetitive deterministic mechanism suggested by 
Procaccia and Tennenholtz [12] and Dekel et al. [3] for lines extends naturally to trees, and is 
still 3-conrpetitive as well as two-sided incentive compatible. This is essentially the best pos¬ 
sible OEl. We then show that by allowing randomization one can construct a 2-conrpetitive 
randomized mechanism that is two-sided incentive compatible, and this is also essentially tight. 
This result also closes a gap left in the work of Procaccia and Tennenholtz M and Lu et al. [SJ 
for the simpler problem of designing strategy-proof mechanisms for weighted agents with no 
mediators on a line, while extending to the more general model of trees. We also investigate a 
further generalization of the above setting where there are multiple levels of mediators. 
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1 Introduction 


The Algorithmic Mechanism Design literature is generally interested in the implications of strategic 
behavior on the quality of social decision making. The usual assumption is that agents interact 
directly with a mechanism that picks an outcome. Yet, in many complex real world settings the 
interaction goes through intermediaries. If these intermediaries are acting strategically, this can 
influence the outcome picked by the mechanism, and result with an increase in social cost. 

Consider, for example, a political decision taken by indirect voting. There are districts, and 
each district is represented by a representative. Each citizen has a position, and let us assume 
the positions of the citizens are points on an interval. A decision is also a point on the interval, 
and the cost for a citizen of such a point equals to the distance of her position from the decision 
made. Each representative aims to minimize the total cost for his own constituency, while the 
global goal is to minimize the total cost of all citizens. Decisions are taken using the reports of the 
representatives exclusively (there is no direct interaction with the agents), and these representatives 
have the freedom to manipulate their reports if such a manipulation helps their constituency. We 
are interested in questions such as: What is the cost for society of such strategic behavior? How 
should the society set up the decision process to minimize that cost? 

More generally, we are interested in designing mechanisms that interact with strategic agents 
through strategic intermediaries (which we also call mediators). Agents have private information, 
and when put in a game, each agent acts to optimize her own utility! The mechanism designer 
aims to optimize a social goal. The intermediaries do not have any private information of their 
own, rather, each intermediary acts in the mechanism on behalf of the agents associated with him, 
aiming to optimize the same social goal with respect to his agents only (note that he does not have 
a personal agenda and is completely benevolent). As the intermediaries control the information 
flow from the agents to the mechanism, the mechanism faces strategic behavior not only of the 
agents, but also of intermediaries: within the freedom given by the mechanism, an intermediary 
acts strategically to optimize on behalf of the agents he represents! In this paper we aim to 
understand the implications of the strategic behavior of intermediaries on the welfare of the agents. 

The general framework outlined above can be studied in the context of many specific settings, 
and might yield very different results in different cases. Here, we focus on one such example and 
leave the consideration of other settings for future works. The setting we consider is facility location 
on a metric induced by a publicly known tree, which generalizes the decision making problem on a 
line introduced above. There are n agents, each located at some private location. The agents are 
partitioned to k disjoint sets, and each set is represented by a unique mediator. The mechanism 
(or center) should locate one facility. The cost of an agent is her distance from the location of the 
facility, and she aims to minimize her cost! The social goal considered is the goal of minimizing 
the total distance of the agents from the facility. 

If the center had access to the locations of all agents he could minimize the total cost by locating 
the facility at a median of all locations. While all our results hold for general tree metrics, for the 
sake of the exposition, in the introduction we mainly discuss the euclidian metric on an interval of 
the real line. For that metric, if f* is the i-th left most agent (breaking ties arbitrarily) and n is 

1 Throughout the paper we refer to an agent as “she”, and to a mediator as “he”. 

2 We assume that an intermediary is able to manipulate the reports of his agents, and do not consider settings in 
which there exists an infrastructure for sending messages between the agents and the center through the mediators 
in a non-manipulable way (e.g., using cryptographic means.) 

3 While the general framework does not preclude transfer of utilities, in this specific model there is no money 
and utilities cannot be transferred. Thus, our results for facility location can be viewed as part of the literature on 
approximate mechanism design without money m- 
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odd, then there is a unique optimal location at the median location tr n+ iw 2 (for even n there is an 
interval of optimal locations, between the two medians). With strategic agents but no mediators 
(or equivalently, with non-strategic mediators), there is a dominant strategy mechanism that is 
optimal: locate the facility at a median point, breaking ties to the left. While this result gives 
a complete picture for the standard model without strategic mediators, we show that with both 
strategic agents and strategic mediators the picture is much more complicated. We first show that 
there does not exist a dominant strategy mechanism achieving any approximation. This happens 
even in a simple setting with two possible locations, a single mediator and a single agent, as if the 
agent switches between the locations in her report, the mediator should switch them back, and vice 
versa. 

Given the impossibility to achieve a dominant strategy mechanism with good performance, we 
suggest a slightly weaker solution concept for direct revelation mechanisms (in which each agent 
reports her location, and each mediator reports the locations of all his agents). Our aim would 
be to build mechanisms which achieve good approximation (minimize the ratio between the cost 
of the outcome and the optimal cost). A mechanism is agent-side incentive compatible (agent- 
side 1C) if each agent has a dominant strategy to be truthful given that her mediator is truthful 
(regardless of any parameter of the model, like the number of mediators, and regardless of other 
players’ strategies). A mechanism is mediator-side incentive compatible (mediator-side IC) if each 
mediator has a dominant strategy to be truthful given that all his agents are truthful (again, 
regardless of any parameter and regardless of other players’ strategies). We aim to construct 
mechanisms that are two-sided incentive compatible (two-sided IC), i.e., they are both agent-side 
incentive compatible and mediator-side incentive compatible. We construct both deterministic and 
randomized mechanisms, and prove that they achieve essentially the best possible performance 0 

One of the settings considered by Procaccia and Tennenholtz [12] is equivalent to designing 
deterministic mediator-side IC mechanisms on an interval of the real line. Their work implies that 
the results of Dekel et al. [3] for regression learning induce a 3-competitive deterministic mediator- 
side IC mechanism on an interval, and that this is essentially the best possible competitive ratio 
for such a mechanism. The mechanism induced works as follows: for every mediator, it replaces 
all points reported by the mediator by the optimal! location for that mediator, and then finds 
an optimal location with this new input (the mechanism essentially computes median of medians, 
weighted by the number of agents each mediator represents). 

We prove the above mechanism is also agent-side IC, and describe a simple extension of it to 
general trees. This yields the following theorem. 


Theorem 1.1. There exists a deterministic two-sided IC mechanism on tree metrics with a com¬ 
petitive ratio of 3. Moreover, for any fixed e > 0, there is no deterministic two-sided IC mechanism 
with a competitive ratio of 3 — e. 

Procaccia and Tennenholtz m raised the question whether it is possible to get a better compet¬ 
itive ratio using randomization. They were able to answer affirmatively in the case of two mediators 
representing a “similar” number of agents. Lu et al. [3] extend the analysis of the mechanism of 
[12] to the case of multiple mediators representing a “similar” number of agents. However, even if 
all mediators have equal number of agents, the competitive ratio of this mechanism approaches 3 

4 A11 the mechanisms we construct run in polynomial time, while our lower bounds hold independent of compu¬ 
tational consideration. Like in prior literature in approximate mechanism design without money [12], the barrier to 
optimality is incentives, not computation. 

5 There might be multiple optimal locations, in such cases ties need to be handled carefully to preserve incentives. 
To simplify the exposition, in the introduction we assume there are no ties. 
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as the number of mediators increase. On the negative side, |8] gives a hardness result of 1.33 using 
a complex LP-based proof. 

We suggest a new and sophisticated randomized mechanism that is 2-competitive and works 
for any tree. We also prove using a simple argument that this is essentially the best possiblell 

Theorem 1.2. There exists a randomized two-sided IC mechanism on tree metrics with a competi¬ 
tive ratio of 2. Moreover, for any fixed e > 0, there is no randomized two-sided IC mechanism with 
a competitive ratio of 2 — e. 

This result closes the gap left in the work of Procaccia and Tennenholtz [12] and Lu et al. [8] for 
the simpler problem of designing strategy-proof mechanisms for weighted agents with no mediators 
on a line, while extending to the more general model of trees. 

For the case of locations on an interval of the real line the mechanism works as follows. For 
every mediator it replaces all points reported to the mediator by the optimal location for that 
mediator. For simplicity assume that the number of agents can be divided by 4. Then, it sorts 
the locations and uses a uniformly selected point among the n/2 central points (that is, the points 
from the n/ 4 + 1 leftmost location to the 3n/4 leftmost location). 

The randomized mechanism for trees generalizes this idea but is much more involved, and is our 
main technical contribution. This mechanism chooses from the set of medians (optimal locations of 
mediators) a “core” subset. This core is the equivalent of the central points from the line case. Each 
point in the core is assigned some positive probability to become the facility location. However, 
unlike in the line case, the probabilities assigned to the points of the core are non-uniform, and 
are carefully chosen to achieve both the competitive ratio and the right incentives. The exact 
probability distribution depends on the medians of all mediators, including medians outside of the 
core. If all the reports happen to fall on a single line, then the probability distribution becomes 
uniform, and the algorithm reduces to the one described above for lines. 

We remark that all our mechanisms run algorithms that use only the optimal location for each 
mediator, and do not need, in addition, access to the exact locations of the agents associated with 
each mediator. We call an algorithm that satisfies this property a mediator based algorithm. We 
prove that such algorithms, which use only the locations of the optimal points of the mediators (and 
not the locations of their agents), cannot be better than 2-competitive. Interestingly, we show that 
there exists a deterministic mediator based algorithm that has a competitive ratio of 2, yet that 
algorithm is not two-sided IC. Thus, for deterministic two-sided IC mechanisms, the implications 
of strategic behavior by the mediators goes beyond the constraint of being mediator based; such 
mechanisms cannot be better than 3-competitive (which is tight). Thus, there is a gap that is a 
result of incentives, and is not due to insufficient information. 

Tree metrics are a strict generalization of line metrics and capture domains that cannot be rea¬ 
sonably modeled by line metrics. Consider the following toy example. People of three nationalities 
live in a single country ( e.g ., Switzerland), and want to elect a president. The candidates for the 
position differ in two attributes: their nationality and their degree of nationalism (for example, 
how much are they willing to settle for a compromise when dealing with an issue on which the 
different national groups disagree). Each citizen, naturally, wants to elect a president sharing his 
nationality, but different citizens of the same national group might want to elect candidates with 
different degrees of nationalism. Notice that a candidate of low nationalism is more acceptable by 
citizens of other nationalities (regardless of the level of nationalism, every citizen would probably 
like to have a president of her own nationality), thus, the metric induced by this example is a star 

6 Like the hardness of ]8], our hardness holds, in fact, even for mediator-side IC mechanisms. 
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with 3 edges (of course, one can think of a country with more nationalities to get a star with more 
edges). 

We also consider a generalization of the above setting allowing multiple levels of mediation. 
In other words, the center, agents and mediators form a tree, in which the root is the center and 
the leaves are the agents. Every internal node of the tree is a mediator representing its children 
in the tree. Unfortunately, the competitive ratio of every mechanism for this setting degrades 
exponentially with the height of the tree, even when the mechanism is only required to respect a 
very weak definition of incentive compatibility. This result is consistent with the existence of a 
symmetric voting system composed of k levels where a minority of size exponentially small in k can 
control the decisions of the system[j] Finally, we show that the mechanism that iteratively applies 
weighted median has a competitive ratio which is essentially optimal and satisfies the weak notion 
of incentive compatibility. 

1.1 Related Work 

In this paper we deal with mediators who act as intermediaries between a set of agents and a 
mechanism. The most related setting studied in the literature is the recent work on auctions 
with intermediaries [3]. There, as in our setting, both agents and intermediaries are strategic. 
However, the setting there is Bayesian while ours is Pre-Bayesian. Also, our aim is to address the 
social welfare issue requiring dominant strategies by the agents when their associated mediator is 
truthful, rather than revenue maximization. 

More generally, our work refers to the study of mediators (see, e.g., [9]J for a study in the context 
of complete information games, and [2] for a study in the context of incomplete information games). 
However, the typical signature of work on mediators is a single mediator that serves as an arbitration 
device: the agents are not a captive audience, and each of them can decide to participate in the 
game directly or work through the mediator. In our setting there are multiple intermediaries, each 
having his own captive audience, which must play the game through the intermediary. Moreover, 
the intermediaries are players and try to optimize their own utilities. Our setting nicely fit with 
situations such as voting by the (already selected) representatives of a geographic area or interest 
group. Additionally, we would like to mention the work of Leyton-Brown et al. [6] which deals 
with game theoretic aspects of bidding clubs in which “collusion devices” (cartels) are strategically 
created in a fixed mechanism (first price auction). In contrast, in our setting the partition of agents 
to mediators is pre-determined and our focus is on mechanism design given that fact. 

The specific example of the framework that we consider is related to the recent literature on 
approximated mechanism design without money m- This literature deals with approximation 
algorithms which are used to resolve incentive issues for tractable problems rather than overcome 
computational complexity of intractable problems, when no money transfers are available. An ad¬ 
ditional conceptual contribution of our work is extending the literature on approximate mechanism 
design without money to incorporate mediation devices. Indeed, the problem studied in this paper, 
the facility location problem, is the canonical problem of that literature, which is easily solved 
(optimally) if no intermediaries are in place. 

As pointed out in the previous section, the design of mediator-side incentive compatible mecha¬ 
nisms is equivalent to the design of strategy-proof mechanisms for weighted agents that was studied 
by Procaccia and Tennenholtz m, and later also by Lu et al. 0 (these papers only considered the 
special case of a line metric). The implications of this equivalence to our settings were discussed 
above. 

7 See http://gowers.wordpress.com/2013/10/15/holding-a-country-to-ransom/ for an example of such a vot¬ 
ing system. 
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The literature on facility location on a line based on information provided by strategic agents 
is classic in the context of mechanism design with single-peaked preferences m- The extension 
of this problem to facility location on a network has been introduced by m- It has been shown 
that there exist non-dictatorial strategy proof algorithms for facility location on trees, and that 
any graph possessing circles does not allow for that. The study of approximate mechanism design 
without money for networks [l] discusses the minimization of the maximal distance to a facility on 
a network using deterministic and probabilistic strategy proof algorithms, yielding some positive 
approximation results and tight bounds. The problem of approximating the optimal location of 
two facilities on a line using strategy proof mechanisms has been discussed in [7j, while the general 
case of locating k facilities in an approximately optimal manner using strategy proof mechanisms 
can be handled for large populations by the general technique given in nu. 

2 Model and Solution Concept 

Within the general framework of strategic mediators we focus on one specific mechanism design 
problem: facility location on a metric induced by a publicly known tree T = ( V , E ) with the 
following metric on each edge. Each edge e £ E in the tree is mapped to the interval [0,£ e ] for 
some l e > 0, with the usual Euclidian metric. In our problem there are n agents, each of which 
has a private position which can be represented by a point on the tree. The position of an agent 
can be either a node v G V or a point somewhere along an edge e G E. Each one of the n agents 
is associated with one of k mediators. For i G [k], mediator di represents a set Ai of n % agents; we 
denote these agents by ct*.i, 0 ^ 2 ,..., ai yUi . As we assume each agent is associated with exactly one 
mediator, the sets of agents of any two mediators do not intersect and ^i=i n * = n • The position 
of each agent is only known to the agent herself, and we denote the private position of agent a l 3 
by tij. Everything else is common knowledge. In particular, the number of agents represented by 
each mediator is known to the mechanism 0 We call a point from the metric induced by T simply 
a “point”. For example, by saying that “p is a point” we mean that p is a point from the metric 
induced by T. Particularly, the location of each agent is a point. 

The center has to pick a position for a single facility. If the center locates the facility at point p, 
then the cost of an agent a t ,j is dist(p, Uj), where dist(p, tij) is the distance between p and tij along 
the metric induced by T. The social cost of locating the facility at point p is Ylie[k] jelm] dist(p, Uj), 
i.e., the sum of all the agents’ costs. The objective of the center is to pick a location for the facility 
that minimizes the social cost. The cost of a mediator di {i € [k]) is the total cost for the agents 
he represents, which is Ylje[ n } dist(p, Uj). Each mediator aims to minimize his cost. We use the 
term player to denote either an agent or a mediator. We assume that the center and players are 
risk neutral, and for a distribution over locations, they evaluate their cost by the expected cost. 
Note that in our model there is no money, and utilities cannot be transferred. 

An algorithm for the center is a mapping from its input, the locations of all agents, to a location 
for the facility. We say that an algorithm is a-competitive, or has a competitive ratio of a, if for 
any set of locations for the agents, the location picked by the center for the facility induces a cost 
that is at most a times larger than the minimal possible cost (with respect to its input). 

When the agents’ locations are private information the center has to come up with a mechanism 
by which players report their information and this information is used to pick a facility location. 

8 If the number of agents represented by each mediator is considered private information, then the mechanism has 
no way to distinguish “important” mediators representing many agents from “unimportant” mediators representing 
only few agents. This intuitive impossibility can be easily formalized to show that no constant competitive ratio is 
possible. The assumption that size of the population represented by each mediator is public is reasonable in many 
settings, for example, the size of the population of a congressman’s district is publicly known. 
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We consider direct revelation mechanisms in which each agent is asked to report her location 
(to her mediator), and each mediator is asked to report the location of each of his agents. The 
mechanism uses the public information and the mediators’ reports to locate the facility, with the 
aim of minimizing the social cost. We say that a mechanism is a-competitive, or has a competitive 
ratio of a, if under the solution concept that we consider, the location of the facility picked by the 
center has cost that is at most a times larger than the minimal possible cost. Crucial to our model 
is the assumption that the center (or the mechanism) can not interact directly with the agents, 
and has access to their locations only through their mediators, which can manipulate the agents’ 
reports. 

Solution Concept. Any direct revelation mechanism picked by the center puts the agents and 
the mediators (the players), which are both strategic, into a game. We would like to use mechanisms 
which induce games with some desired properties. 

A direct revelation mechanism is dominant strategy truthful if it is a dominant strategy for each 
agent to report her location truthfully (regardless of the strategies chosen by the mediators and the 
other agents), and it is a dominant strategy for each mediator to report the locations of all of his 
agents to the center exactly as reported to him by the agents (again, regardless of the strategies 
chosen by the agents and the other mediators). We observe that asking a competitive mechanism 
to be dominant strategy truthful is unrealistic. 

Observation 2.1. No direct revelation dominant strategy truthful mechanism has a finite compet¬ 
itiveness, and this is true even if the center is allowed to charge the mediators and agents. 

Proof. Consider an instance with a single mediator representing a single agent which can take two 
possible locations x and y. To have finite competitiveness the center must locate the facility at the 
location of the agent (when both the agent and the mediator are truthful). However, the center 
gets no information other than the report of the mediator, and therefore, it must always locate the 
facility at the location reported by the mediator. Moreover, the charges collected by the center can 
depend only on this location. 

Let p x and p y be the charges that the mediator pays when reporting x and y, respectively. 
Assume without loss of generality that p y > p x . Now, assume that the agent’s strategy is to report 
y despite the fact that she is located at x. If the mediator switches the location back, then his 
cost is p x , while a truthful repetition of the agent’s report will result in a cost of p y + |y — x\ > 
Px + |V — x\ > p x . Thus, it is clearly non-optimal for the mediator in this case to truthfully repeat 
the report of the agent. □ 

Remark 1. The above impossibility applies to a setting in which all entities have exactly the 
same utility function, so there are no conflicts. It is a result of the sequential nature of information 
propagation from the agents to the center through the mediators, and the incompatibility of that 
with dominant strategies. 

Given this impossibility result we need to settle for a slightly weaker solution concept, achieving 
Incentive Compatibility (IC) in the following sense. We still want each agent to have an incentive to 
be truthful, as long as her mediator is truthful (as opposed to playing an “unreasonable” strategy), 
and we want each mediator to be truthful as long as his agents are truthful. This is captured by 
the following definition. 

Definition 2.1. A direct revelation mechanism is agent-side incentive compatible if for every 
mediator di , in the induced game created by fixing di to be truthful, truthful reporting is a dominant 
strategy for each agent a h] represented by di. 
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A direct revelation mechanism is mediator-side incentive compatible if for every mediator di, 
in the induced game created by fixing all di’s agents to be truthful, truthful reporting is a dominant 
strategy for di. 

A direct revelation mechanism is two-sided incentive compatible if it is agent-side incentive 
compatible and mediator-side incentive compatible. 

Note that in any two-sided incentive compatible mechanism, it is in particular an Ex-post Nash 
for all players to be truthful. 

To understand the implication of strategic behavior by the mediators we compare the compet¬ 
itiveness achieved by the best two-sided incentive compatible mechanisms to the competitiveness 
achieved by the best agent-side incentive compatible mechanisms (we do so both for deterministic 
and for randomized mechanisms). 

2.1 Agent-Side Incentive Compatible Mechanisms 

Median points play a significant role both in the optimal algorithm and our mechanisms. We 
next present some basic definitions and observe that median points exactly characterize optimal 
locations. 

Definition 2.2. Following are the definitions of median points and weighted median points. 

• A weighted point is a pair (p, x) where p is a point and x is a positive real number. Given 
a weighted point p = ( p,x ), we say that x is the weight of p, and write w(p) = x. We also 
think of p as located at location p in the metric. Hence, we can talk, e.g., about the distance 
between two weighted points^ 

• Given a multi-set S of elements, let f(p,S) be the multiplicity of p in S (i.e., f(p,S) is 
the number of copies of p in S). Given an additional multi-set S', we denote by S U S' and 
S\S' two multi-set containing f(p,S) + f(p, S') and max{/(p, S) — f(p, S'), 0} copies of every 
element p £ S U S', respectively. 

• Given a multi-set S of weighted points and a point p. 

— Let S p denote the multi-set of weighted points in S that have p as their location. More 
formally, for every weighted point q = (q, x) € S, the multi-set S p contains f(q, S) copies 
of q if q = p, and no copy of q otherwise. 

— The weight of S, denoted by w(S), is the total weight of the weighted points in S. More 
formally, w{S) = Y,qeS w (v) ' 

— Let m p be the maximum weight of a multi-set S' C S \ S p such that the path connecting 
every two weighted points of S' does not go through p. 

— We say that p is a weighted median of S if m p < w(S)/2. 

• Given a multi-set S of points, we say that a point p is a median of S if it is a weighted median 
of the multi-set S' containing f(p, S ) copies of (p, 1) for every point p G S. 

Informally, a point p is a median of S if removing it splits the tree T into parts, each containing at 
most \S\/2 points of S. The importance of median points stems from the following easy observation, 
whose proof (with simpler versions going back to [5]) is deferred to Appendix lAl 

Observation 2.2. For every non empty finite multi-set S of weighted points, there is at least one 
weighted median. Moreover, a point p is a weighted median of S if and only if locating the facility 
at p minimizes the weighted total cost of a set of agents located at the points of S (i.e., the sum 
Y,q & s w (?) • /(?> S ) ■ dist (P> q))- 

9 In the interest of readability, throughout the paper we put the tilde sign above letters representing weighted 
points. 
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The mechanism that always picks a median of all the locations of the agents (with a careful tie 
breaking) is optimal and agent-side incentive compatible. Thus, we have the following observation 
that naturally extends a well known result for line metrics [12] . We present its proof for completeness 
in Appendix [A] 

Observation 2.3. There exists a deterministic agent-side incentive compatible mechanism that is 
optimal (i.e., I-competitive). 

We note that the optimal agent-side IC mechanism is deterministic. Hence, randomization 
clearly does not help in improving performance when mediators are not strategic. Our results show 
that this is not the case when mediators are strategic and one aims for two-sided IC mechanisms. 

2.2 Mediator Based Algorithms 

We say that an algorithm for the center is mediator based if it uses only an optimal facility loca¬ 
tion for each mediator (but never uses any other information regarding the positions of the agents 
themselves). We show that for mediator based algorithms, randomization does not improve perfor¬ 
mance, as any such randomized a-competitive algorithm can be transformed to a mediator based 
deterministic algorithm with the same competitive ratio (moreover, the resulting algorithm per¬ 
forms at least as good on every single input). To state this result we first need the following lemma 
whose proof appears in Appendix lAl 

Lemma 2.4. For any tree and any distribution over points F, there exists a point p(F) such that 
for any finite multi-set S of points: 

^ dist (p , ,g) ■ f(q,S) >^dist (p(F),q) ■ f(q,S) . 

q£S q£S 

Moreover, for the euclidian metric on [ a,b] (for arbitrary a and b) the expected location according 
to F can serve as such a point p(F). 

Note that p(F) does not depend on S, and the same p(F) works for every S. A randomized 
algorithm maps the locations of all agents to a distribution over locations F. By the above lemma 
the deterministic algorithm that instead locates the facility deterministically at p(F) can only 
improve the social cost for every input. Thus, we have the following corollary. 

Corollary 2.5. Given any a-competitive mediator based randomized algorithm, it is possible to 
construct a mediator based deterministic algorithm with the same competitive ratio a. 

Note that the above transformation does not maintain incentives, indeed we show below that 
there exists a mediator based randomized two-sided IC mechanism which is 2-competitive, but no 
mediator based deterministic two-sided IC mechanism achieves this competitive ratio. Thus, al¬ 
though randomization does not improve performance for mediator-based algorithms, it does improve 
performance for mediator-based two-sided IC mechanisms. 

3 Deterministic Two-Sided IC Mechanisms 

In this section we extend the deterministic “median of medians” mechanism of mm from lines 
to trees and show that the resulting mechanism, which we call the Weighted Median Mechanism 
(WMM), is a deterministic two-sided IC mechanism. This mechanism is also 3-competitive, which is 
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essentially tight by a lower bound of [3} [12] (given for completeness as Theorem l3.3l) . The mechanism 
essentially elicits from each mediator an optimal location from the mediator’s perspective (median 
of the mediator’s agents), and then picks a weighted median of these locations. To create the right 
incentives for the agents and mediators, tie breaking must be handled carefully in both steps of 
the mechanism. By breaking ties in a way that is independent of the players’ reports, we make 
sure the players have no incentive to manipulate. The basic idea is that in each step we break ties 
in favor of the point closest to an arbitrary predetermined point. To formally describe WMM we 
need the following observation which proves that the above tie breaking rule is well defined, i.e., 
whenever the mechanism has to decide between a set of points, there is always a unique point in 
the set which is closest to the arbitrary predetermined point. 

Observation 3.1. Given a non-empty finite multi-set S of weighted points and an arbitrary point 
z, the set M of weighted medians of S contains a unique point p closest to z. 

Proof. The proof of this observation consists of several steps. First, showing that it is enough to 
consider a finite set R of weighted medians because for every weighted median p either p £ R or 
there is a weighted median r(p) £ R which is closer to z than p. Second, restricting ourselves to the 
set R' of weighted medians from R that are closest to z (i.e., as close to z as any other weighted 
median of R). Finally, proving that R' is of size 1, which completes the proof of the theorem. 

Given a point p £ M, let us denote by r(p) the first point of V U {z} U {// | 3 x (p', x) £ 5} along 
the path from p to z (there is such a point because z is always on this path and the set is finite). 
Let us prove that r(p) is also a weighted median of S (i.e., r(p) £ M ). Consider an arbitrary 
sub multi-set S' of weighted points from S which are connected via paths that do not go through 
r(p). If p £ V U {z} U {// | 3 x (p',x) £ 5}, then r(p) = p, and therefore, the weighted points of 
S' are connected via paths that do not go through p as well. Otherwise, every path connecting 
two weighted points of S which goes through p must go through r(p) as well. Hence, once again, 
the weighted points of S' are connected via paths that do not go through p. In both cases, we get 
w(S') < w(S)/2 because p is a weighted median of S, which completes the proof that r(p) £ M. 

Let R = {r(p) \ p £ M} and R' = {p £ R | dist(p, z) = min ge ^dist(g, z)} (the minimum in the 
definition of R' always exists because R is a finite set). Observe that R' C R C M, and for every 
point p £ M either p = r(p) or dist(p, z) > dist(r(p), z). Hence, every point in R' is closer to z than 
any point of M \ R. Moreover, all the points in R' have the same distance from z. To complete the 
proof we have to show that \R'\ = 1. Clearly, R' 0, so it is enough to show \R'\ < 1. Assume 
towards a contradiction that there are two points p q in R'. Let y be the first point where the 
paths from p and q to z meet. Since dist(p, z) = dist(g, z), y cannot be either p or q, and therefore, 
dist(y, z) < dist(p, z) = dist(g, z). Observe that y is located along the path connecting p and q, and 
therefore, we can partition S into three sub multi-sets whose union is S: 

• Si - Contains weighted points of S for which the path connecting them to y goes through p. 

• S 2 - Contains weighted points of S for which the path connecting them to y goes through q. 

• S 3 - The rest of the weighted points of S (the path connecting them to y does goes through 
neither p nor q). 

All the weighted points of Si U S 3 are connected by paths that do not go through q, and therefore, 
w(S± U S 3 ) < w(S)/2 because q is a weighted median of S. Thus, w(S 2 ) = w(S) — w(S\ U S 3 ) > 
w(S)/ 2. Using a similar argument we can also get w(S 1 ) > w(S)/2. As Si U S 2 C S, the two 
results can hold at the same time only if w(S\) = w(S 2 ) = w(S )/2 and w(Ss) = 0. 

Consider now an arbitrary sub multi-set S' of weighted points of S that are connected by 
paths that do not go through y. Clearly, either S' C Si U S 3 or S' C S 2 U S 3 . In the first case 
w(S') < w(S\ U S 3 ) = w(S)/2, and in the second case w(S') < w(S 2 U S 3 ) = w(S)/2. Hence, 
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in both cases w(S') < w(S)/2 , which implies that y € M. However, this is a contradiction since 
dist(y, z) < dist(p, z) = dist(g, z). □ 


The Weighted Median Mechanism (WMM) is a direct revelation mechanism in which the center 
does the following: 

• For each mediator di it computes £i which is the median of the multi-set {tC 11 < j < n*} 
closest to Zi, where t[ j is the location reported by di for agent a^j and Zi is an arbitrary point 
chosen independently of the reports received from the mediators (such a median exists, and 
is unique, by Observation 13.111 . 

• Let M be the set of weighted medians of the multi-set {{£i,nf )|1 < i < k}. Locate the facility 
at the point of M closest to z, where 2 : is an arbitrary point chosen independently of the 
reports received from the mediators. 

Note that this direct revelation mechanism can also be executed with much less communication 
since the only information the center needs from each mediator is a single point (the location of 
the median closest to some arbitrary point), and not the location of every agent represented by the 
mediator. Thus, the center can ask each mediator di to report a single location £i, and locate the 
facility at the weighted median of the multi-set {(£i,ni) |1 < i < k} closest to some point 2 : picked 
in advance. Observe that this algorithm for the center is mediator based. The resulting mechanism 
clearly achieves the same competitiveness as the direct revelation mechanism when each mediator 
di indeed reports a median location £i of his agents closets to an arbitrary point Zi since the location 
is picked using exactly the same method. Moreover, this mechanisms is also two-sided IC since 
the space of possible deviations for the mediators in this mechanism is more restricted than the 
corresponding space in the direct revelation mechanism. 

On line metrics WMM is essentially identical to a mechanism already known to be 3-competitive 
and mediator-side IC by an observation of m based on a result of [3j. We present a proof that 
WMM is 3-competitive and mediator-side IC not only for lines, but also for trees. We also show 
that it is agent-side IC. The next theorem summarizes the proved properties of WMM. 

Theorem 3.2. For any tree metric, the Weighted Median Mechanism is a deterministic two-sided 
IC mechanism with a competitive ratio of 3. 

The proof of Theorem 13.21 can be found in Section T3.11 The following theorem proved by [3J H2] 
shows that WMM has an optimal competitive ratio. For completeness, we give a proof of this result 
in Section [5j 

Theorem 3.3. Fix any constant e > 0. Then, no direct revelation deterministic mechanism that 
is mediator-side incentive compatible has a competitive ratio of 3 — e, even for line metrics. 

3.1 Analysis of WMM 

In this section we analyze WMM and prove Theorem 13.21 We begin with the following auxiliary 
observation. This observation shows that, given a fixed arbitrary point z, if the weighted median 
of a multi-set S closest to z changes when some weight moves from the point p to the point p', then 
this weighted median must be located along the path connecting p and p', and moves along this 
path in the direction from p to p'. 

Observation 3.4. Consider: 

• Two multi-sets of weighted points S and S' differing in a single weighted point, i.e., S' = 
S \ {p} U {p'} for a pair of weighted points p £ S and p' having the same weight (i.e., 
w(p) = w{p')). 


10 


• An arbitrary point z. 

• The weighted medians q (q ! ) of S (S') closest to z. 

If q 7 ^ q' then the path connecting p and p' goes through both q and q', and q is closer to p than q'. 

Proof. Assume q q' ■ If q' is also a weighted median of S than q must be closer to z than q'. 
Similarly, if q is also a weighted median of S' than q' must be closer to z than q. As it is not 
possible that both q is closer to z than q', and q' is closer to z than q, it must be the case that 
either q is not a weighted median of S', or q' is not a weighted median of S. We assume without 
loss of generality that first option is true. Assume towards a contradiction that q is not on the 
path from p to p'. Let P' be a sub multi-set of weighted points from S' that can be connected via 
paths that do not contain q. If f{p', P') < f{p', S') then w(P’) < w(S)/2 = w(S')/2 because q is a 
weighted median of S. Otherwise, consider the multi-set P = P' \ {p'} U {p}. By our assumption 
that the path between p and p' does not contain q, we get that P is a sub multi-set of weighted 
points from S connect by paths that do not include q. Hence, w(P') = w(P) < w(S)/2 = w(S')/2. 
In conclusion, we got that the weight of P' is always at most w(S')/ 2, and therefore, q is a weighted 
median of S', which contradicts our assumption. Hence, we can assume from now on that q is on 
the path between p and p'. 

Since q is not a weighted median of S', there exists a sub multi-set S" of weighted points from 
S' that are connected via paths that do not go through q and obeys w(S") > w(S')/ 2 . Since q is a 
weighted median of S, we must have: f{p',S") = f(p',S'). Assume now, towards a contradiction, 
that q' is on the path between p and q. This assumption implies that q is between q' and p', and 
therefore, the points of S" are connected by paths that do not go through q'. However, since q' 
is a weighted median of S", this implies w(S") < w{S')/2, which is a contraction. Hence, we can 
assume from now on that q' is not on the path between p and q. 

Next, assume towards a contradiction, that q' is not on the path between q and p'. Let t be 
the point on the path between q and p' closest to q'. Consider an arbitrary sub multi-set P of 
weighted points from S' connected via paths that do not go through t. If the points of P can 
be connected via paths that do not go through q ', then w(P) < w(S')/2 because q' is a weighted 
median of S'. Otherwise, at least one weighted point of P is separated from t by q', and therefore, 
p' qL P. Moreover, all the paths between weighted points of P do not go through q, and since q is 
a weighted median of S, we get: w(P) < w(S)/2 = w(S')/2. In conclusion, every sub multi-set P 
of weighted points from S' connected via paths that do not go through t has a weight of at most 
w(S')/2, and therefore, t is also a weighted median of S'. By definition q' is the weighted median 
of S' closest to z, and therefore, q’ is closer to 2 than t. 

From the last conclusion we learn that the path from q to z goes through t and q'. Hence, q' 
is not a weighted median of S. Thus, there exists a sub multi-set P of weighted points from S 
that are connected via paths that do not go through q', and obeys w(P) > w(S)/2. Since q' is a 
weighted median of S', we must have f(p, P) = f(p, S ). Consider the set P' = P\{p} U {p'}. Since 
p and p' are connect by a path that does not go through q', the points of P' are also connected by 
paths that do not go through q'. On the other hand, q' is a weighted median of S', and P' C S', 
and therefore, w{P) = w(P') < w(S')/2 = w(S)/2, which is a contradiction. Hence, q' must be on 
the path connecting q and p', which completes the proof of the observation. □ 

Using the above observation we can now show that WMM is two-sided IC. The agent-side IC 
of WMM is an easy new result, while the mediator-side IC is known from is m for lines, and we 
extend it to trees. 

Lemma 3.5. The Weighted Median Mechanism is agent-side IC. 
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Proof. Consider an arbitrary agent dij. We would like to show that given that di is truthful, it 
is non-beneficial for aij to report a fake location t[ ■ instead of its true location tij. Let and l[ 
denote the median of the reports of agents {^ 1 , 0 ^ 2 , • • • ,ai, ni } when a iyj reports its true location 
ti and the fake location tf respectively. If li = If then reporting the fake location t\- does not 
affect the location picked by the center, and therefore, is non-beneficial for aij. Otherwise, by 
Observation 13.41 li is on the path between tij and If. 

Let F and F' be the locations that the center picks for the facility assuming a,;j reports a location 
of tij or t[ j to her mediator, respectively. If F = Ff then reporting the fake location tfj is, again, 
non-beneficial for aij. Otherwise, by Observation 13.41 F and F' are both on the path between li 
and If and F is closer to li than F'. Combining both results, we get that the path from tij to l[ 
goes first through li, then through F and finally through F'. Thus, dist (tij,F) < dist(tjj, F'), and 
it is strictly non-beneficial for a ? ;j to report the fake location t\ ■. □ 

Lemma 3.6. The Weighted Median Mechanism is mediator-side IC. 

Proof. Consider an arbitrary mediator d{. We should prove that it is always a best response for 
di to be truthful given that the agents of Ai are truthful. Let li be the median of the locations 
reported by the agents to di, that is closest to z % . Since the agents of Ai are truthful, l t is also 
the median of the true locations of the agents of Ai that is closest to z*. Every strategy of di can 
be characterized by the median closest to Zi of the locations reported by di on behalf of his agents 
because this is the only information used by the center. Hence, we can characterize any deviation of 
di from truthfulness by the median l! i of the locations it reports that is closest to z*. Our objective 
is to prove that a deviation represented by l\ is no better than being truthful. 

First, if li = If then the location picked by the center is unaffected by the deviation, and 
therefore, the deviation is non-beneficial. Thus, we can assume from now on li 7 ^ If Next, let 
F and F' be the locations picked by the center given that di reports locations whose medians 
(picked by the center according to the tie breaking rule) are li and If respectively. If F = F' then 
the deviation is clearly non-beneficial. Otherwise, by Observation 13.41 the path from li to l\ goes 
through F and F' in that order. 

For every x G [0,1], let p x be the point along the path between l t and l\ obeying dist (li,p x ) = 
x ■ dist (li,lf), and let c x be the social cost of the agents of Ai if the facility is located at p x . Let p 
be an arbitrary point, and let us observe the change in dist (p x ,p) as x increases. Clearly, dist (p x ,p) 
initially decreases at a constant rate and then starts to increase again in a constant rate (possibly 
the decrease part or increase part is of zero length). Hence, the derivative of c x is a non-decreasing 
function in x (except for a finite number of x values for which it might not be defined). For x = 0, 
p x is the median of the locations of the agents of di, and therefore, c x is minimized for x = 0. 
Thus, the derivative of c x is non-negative for every x > 0, whenever it is defined. Let y and y' be 
the values of x for which F = p y and F' = p y <. Then, we have y < y', and therefore, c y < c y >, 
i.e., locating the facility at F' induces no better social cost for the agents of Ai in comparison to 
locating it at F. Hence, reporting a set of locations whose median is l[ instead of a truthful report 
(which makes li the median closest to zf) is non-beneficial from df s perspective. □ 

To complete the proof of Theorem 13.21 we still need to show that WMM is 3-competitive. 
This was already observed by mm for lines, but not for trees. In order to prove this result for 
trees, we observe that WMM is a restriction of a more general mechanism called IWMM which we 
analyze in Section [ 6 l The competitive ratio of WMM follows immediately from the result proved 
by Proposition 16.21 for IWMM. 
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4 Randomized Two-Sided IC Mechanisms 


It is known by mm that there is no deterministic Two-Sided IC Mechanism that is better than 3- 
competitive (this result also appears as Theorem 13.31) . In this section we show that we can improve 
and achieve a competitive ratio of 2 by switching to randomized mechanisms, and that this is the 
best ratio that can be achieved. 

To simplify the exposition of our mechanism, we first describe it for the simple case of line 
metrics (he., for the case where the tree T is simply an interval). A line metric is the Euclidean 
metric of an arbitrary interval [a, b] (where a < b are real numbers). Notice that a point in the 
metric is simply a real number from the interval [a, b\. The Two Percentiles Range Mechanism 
(TPRM) is a direct revelation mechanism in which the center runs the following algorithm: 

• For each mediator di compute the median li of the multi-set {fT11 < j < n«} that is closest 
to Zi, where t\ ■ is the location reported by di for agent ctjj and Zi is an arbitrary point chosen 
independently of the reports received from the agents. 

• Consider the multi-set S of points, created by adding £i to the multi-set n* times, for each i. 
Let Ui denote the i-th element of this multi-set when sorted in any non-decreasing order. 

• Randomly choose a location for the facility from the list: 'fRnMj+i, ^|n/ 4 J+ 2 > • • •, u\:s n /A\, where 

the probability of each value Ui in this list is {n/2 ) , except for the first and last values 

(' U |n/ 4 I + 1 an< ^ «r 3 n/ 4 l), which have a probability of (1 — r/4)/(n/2) where r is the reminder 
of dividing n by 4. 

Like in the deterministic case, Observation P ensures that is well defined for every i. Also 
similarly to the deterministic case, this direct revelation mechanism can also be executed with 
much less communication by only asking each mediator to report a single point (the location of the 
median closest to some arbitrary point) and running a mediator based algorithm that corresponds 
to the two final steps of TPRM on the reports. The resulting mechanism will have the same 
properties (competitiveness, incentives) as the direct revelation mechanism. 

Theorem 4.1. For any line metric, the Two Percentiles Range Mechanism (TPRM) is a random¬ 
ized two-sided IC mechanism with a competitive ratio of 2. 

The proof of Theorem 14.11 is deferred to Section 14.11 The intuition behind the improved com¬ 
petitive ratio of TPRM, as compared to WMM, is as follows. Consider a section s connecting the 
real locations of two agents separated by no other agent. 

• If s is located near the median of all the agents, then it is not very important on which side 
of it is the facility located, because either way s will contribute to the distance between the 
facility and about half the agents. 

• If s is located very far from the median, then both TPRM and WMM will locate the facility 
at the “right” side of s. 

• If s is not located near the median, nor very far from it, then it is important that the 
mechanism will locate the facility at the right side of s. WMM makes the wrong call for some 
inputs, and, being deterministic, when it makes the wrong call it makes it with probability 
1. On the other hand, it can be shown that the randomized TPRM always has a significant 
probability of making the right choice. 

Next, we present a mechanism that extends TPRM to general trees, called the Tree Randomized 
Mechanism (TRM). Like TPRM, TRM defines a distribution over the medians of the sets reported 
by the mediators, and then chooses the facility location randomly according to this distribution. 
The distribution is carefully picked to achieve the best possible competitive ratio of 2. More 
specifically, the mass of the distribution is placed on the “central parts” of the tree, which is 
analogous to behavior of TPRM. Formally, TRM is a direct revelation two-sided IC mechanism 
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that is also 2-competitive. The description of TRM consists of two parts. The first part of TRM 
determines the medians of the sets reported by the mediators, and then splits the edges of T at 
these points, which allows us to assume that all medians are vertexes of T. Additionally, this part 
finds a point r which is considered to be the “center” of T. If necessary, an edge is split to make r 
a vertex of T, and then T is rooted at r. 

TRM - Part 1 

1. Fix an arbitrary point z independently of the reports of the mediators. 

2. Define size(p) = n* for every point p. where li is the median closest to z of the points 

reported by d{. 

3. Let L = {(p, size(p)) | p is a point and size(p) > 0}. 

4. Let r be the weighted median of L closest to z. 

5. For every point p € {r} U {li | 1 < i < k] do: 

6 . If p is an internal point of an edge e, then: 

7. Split e at point p to two edges. 

8 . Root T at r. 

Clearly we can assume after the first part of TRM that {r} U {li | 1 < i < k} C V. Since T is 
now rooted, we can define additional notation that is used to describe the second part of TRM. 

Definition 4.1. 

• children (n) - The set of children nodes of u in the tree. 

• subtree(n) - The set of nodes in the subtree of node u, including u itself. More formally, 
subtree(n) = {u} U [U u , echildren(u) subtree^')]- 

• treesize(n) - The number of agents represented by a mediator d% for which li € subtree(n). 
More formally, treesize(n) = size(n) + X^ u 'e c hiidren(u) treesize(u')- 

The second part of TRM defines the probability distribution used to select the facility location. 
Informally, this distribution has the following property: the probability that the facility location is 
in the subtree of a (non-root) vertex u is proportional to max{treesize(n) — n/4,0}. An alternative 
view of this distribution is that the algorithm preprocess the agents by discarding some agents of 
peripheral vertexes, and “promoting” other agents of these vertexes to be associated with vertexes 
which are closer to the root. After this preprocessing, the algorithm selects a uniformly random 
remaining agent, and locates the facility at the location desired by the mediator associated with 
the selected agent. The preprocessing is done in a way that guarantees the following: 

• If only a few agents originally belong to the subtree of a vertex u (i.e., treesize(n) < n/4), 
then all these agents are either discarded or promoted to some ancestor vertex of u, and thus, 
u never becomes the facility location. 

• If many agents originally belong to the subtree of a vertex u, then the preprocessing reduces 
the number of agents associated with this subtree by n/4 (or n/2 if u is the root). 

TRM - Part 2 

1. Let X = {u € V | treesize(n) > n/4}. 

2. Let c(u ) denote the value n/4 for every node u G X — {r} and n/2 for u = r. 

3. Pick every node u € X as the facility location with7 probability: 

, x treesize(u) — c(u) S u 'exnchiidren(u)( treesize ( u/ ) - c ( u ')) 

PM = --T2- ' 

For consistency, define p(u) = 0 for every point u (f X. 
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Figure 1: Example of a probability distribution induced by TRM. 

Note that this direct revelation mechanism can also be viewed (and executed) as a mediator 
based algorithm as the center only needs from each mediator d% a location li which is optimal from 
dfs perspective. The proof of the next theorem is deferred to Appendix [Bj 

Theorem 4.2. For any tree metric, the Tree Randomized Mechanism (TRM) is a randomized 
two-sided IC mechanism with a competitive ratio of 2. 

Let us get a better understanding of TRM by considering an example input (given as Figured]) 
and explaining the probability distribution induced by TRM. The figure depicts the top 7 nodes of 
an example tree T that can be outputted by the first part of TRM. The number inside each node 
represents the portion of the agents population represented by mediators whose (sole) median is 
this node. For example, the number 0.24 appears inside the root node R, hence, in our example, 
0.24 • n agents are represented by mediators whose median is R. Some of the nodes in the figure 
have a triangle shape dangling from them. Each triangle represents the subtree of the node it is 
dangling from, and the number written inside it represents the portion of the agents represented by 
mediators whose (sole) median is inside the subtree (but is not the root of the subtree). For example, 
inside the rightmost triangle we have the label 0.25. Hence there are 0.25 • n agents represented 
by mediators whose median is inside the subtree of D, but is not D itself. Finally, outside of each 
node there is an additional number (in a box). This number represents the probability that TRM 
selects this node as the facility location. 

Following is a short explanation of how (some of) the probabilities in Figure [Tj were calculated. 
We say that a node (subtree) in Figured] has a weight of x if x agents are represented by mediators 
whose median is the node (a node in the subtree). Observe that TRM selects R to be the root 
node r since its left subtree has a weight of only 0.36 • n < nj 2, and its right subtree has a weight 
of only 0.4 • n < n/2. Next, observe that no triangle in the figure represents a subtree with weight 
of more than 0.25, and therefore, no node in the subtrees represented by these rectangles has a 
positive probability to be the location of the facility. Consider now a few of the nodes of Figured] 

• The subtree rooted at A (which contains A alone) has a weight of only 0.1 • n < 0.25 • n, and 
therefore, A ^ X and thus has 0 probability to be the facility location. 

• The subtree rooted at B has a weight of 0.26 • n > 0.25 • n, and no other node in this 
subtree has a positive probability to be the facility location. Therefore, B has a probability 
of 2(0.26 — c(B )) = 2(0.26 — 0.25) = 0.02 to be the facility location. 

• The subtree rooted at E has a weight of 0.36 • n > 0.25 • n. However, E has a single child 
B associated with a positive probability, and the subtree rooted at B has a weight of 0.26. 
Therefore, E has a probability of 2(0.36 — c(E) — (0.26 — c(B))) = 0.2 to be the facility 
location. 

• The subtree rooted at R has a weight of 1 • n. However, R has two children E and F, each 
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associated with a positive probability by the algorithm, whose subtrees have weights of 0.36 
and 0.4, respectively. Therefore, R has a probability of 2(1 — c(R ) — (0.36 — c(E)) — (0.4 — 
c(F))) = 0.48 to be the facility location. 

The following theorem shows that TRM and TPRM have optimal competitive ratios, its proof 
is deferred to Section [5j 

Theorem 4.3. Fix any constant s > 0. Then, no direct revelation randomized mechanism that 
is mediator-side incentive compatible and no mediator based algorithm has a competitive ratio of 
2 — e, even for line metrics. 


4.1 Analysis of TPRM 

In this subsection we analyze TPRM and prove Theorem 14.11 We begin the proof by establishing 
some notation. Let us sort the agents according to their location (breaking ties arbitrarily), let 
bi be the i th agent in this order and let U be the location of b t . We say that agent bi is to the 
left (right) of agent bj if i < j (i > j). Let s t denote the segment connecting bi and 6j + 1 , and let 
Xi = ti -|_i — ti be the length of s*. We say that segment Si is to the left (right) of agent b if bi (fti+i) 
is to the left (right) of b. Given a location for the facility, we say that an agent bi uses segment Si 
if Si is on the path between the agent and the given location. 

Before analyzing TPRM, let us make sure it is well defined. 


Observation 4.4. Let pi be the probability that TPRM picks m as the location of the facility. 
Then, Y11=iPi = *• 

Proof. The number of elements in the list u |n/ 4 J+ 3 i ■ ■ ■ ; u [ 3 n/ 4 l i s: 

|"3n/4] — [n/4j = (3n/4 + r/4) — (n/4 — r/4) = n/2 + r/2 . 

Hence, n/2+r/2—2 elements of the list have a probability of (n/2) _1 , and two have a probability 
of (1 — r/4)/(n/2). Summing up all these probabilities, we get: 




n/2 + r/2 — 2 1 — ?’/4 


n /2 


+ 2 - 


n /2 


= 1 


□ 


4.1.1 Proof: TPRM is 2-Competitive 

First we analyze the competitive ratio of TPRM as an algorithm (when players are truthful). The 
following lemma characterizes the social cost of the optimal facility location. This characterization 
is used later to prove the competitive ratio of TPRM. 

Lemma 4.5. The social cost of the optimal facility location is: 

L(n-1)/2J n—1 

£ i-Xi+ Y, (n-i)-*i- (!) 

i=l i=L(re+l)/2j 

Proof. By Observation 12.21 t\ n / 2 \ is an optimal facility location. Let us evaluate the social cost of 
placing the facility at this location. Consider segment Sj. If 1 < i < n/2, then Sj is to the left of 
b\ n / 2 ], and therefore, i agents use sf b\, 62 ,..., bi. Thus, the contribution of s* to the cost of the 
optimal solution is i ■ Xi. On the other hand, if n — 1 > i > n/2 (i.e ., Sj is to the right of b^ n j 2 ]), 
then n — i agents use sf 6*+i, &«+ 2 , • • •, b n . Thus, the contribution of Sj to the cost of the optimal 
solution is (n — i) ■ Xi. Adding up all the above contributions, we get Expression ([T]). □ 
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Pick an arbitrary segment Si = |_i] (recall that ti is the location of agent bi), and let Lj 

and R{ denote the number of Uj values that are smaller than ti +1 and larger then ti, respectively. 
Intuitively, L, and Ri denote the number of agents represented by a mediator dh for which lh is to 
the left and right of Sj, respectively. 

Lemma 4.6. For every i, Li < 2i and Ri < 2(n — i). 

Proof. We prove Li < 2 v, the proof that Ri < 2 (n — i) is symmetrical. Let Di = {c4 | ^ < tj+i}. 
Notice that a mediator dh contributes to Li if and only if £h < L + \ (he., lh is to the left of +), in 
which case his contribution to Li is n^. Hence, Li = Y/d h &Di n h- 

Consider now an arbitrary mediator dh £ Di- At least half the agents of dh must have locations 
smaller or equal to ih < U+i, and therefore must be of the form bj for j < i. Hence, the size of the 
set Bi = {bj | j < i A BheDibj € Ah} is at least ^2d h eD n h /2 = Li/ 2. The lemma now follows by 
observing that B cannot contain more than i agents. □ 

We are now ready to prove the approximation ratio of TPRM. 

Lemma 4.7. The Two Percentiles Range Mechanism is 2-competitive. 

Proof. Let c* be the coefficient of Xi in Expression (|T|) . he.. & = i for i < n/2, and c* = n — i for 
i > n/2. To prove that TPRM is a 2-competitive mechanism, it is enough to show that, given the 
facility location selected by TPRM, the expected number of agents using segment Sj is at most 2 q. 

Consider an arbitrary section Sj, and assume i < n/2 (the case i > n/2 is symmetric). Let 
Pi be the probability that the facility location chosen by TPRM is to the left of Sj. If the chosen 
facility location is indeed to the left of Sj, then all the agents to the right of Sj will use it, he., n — i 
agents. On the other hand, if s* is to the right of the chosen facility location, than only i agents 
will use it. Hence, the expected number of agents using s* is: 


Pi ■ (n - i) + (1 - Pi) ■ i = Pi ■ [n — 2i) + i . 


The ratio between this expectation and c* = i is: 


Pi ■ (y - 2 ) + 1 . ( 2 ) 

Proving the lemma now boils down to proving that 0 is always upper bounded by 2. Notice that 
the coefficient of Pi is always non-negative (recall that we assumed i < n/ 2 ), and therefore, any 
upper bound on Pi will translate into an upper bound on 0 . 

By Lemma F4.61 Li < 2i, which implies, by definition, that U 2 i+i > U +1 (he., U 2 i+i is to the 
right of Si). Hence, the facility location picked can be to the left of s* only if the center picks one 
of the values: ^InMI+i,' u |n/4J+2, ■ ■ ■, U 2 i- Thus, we can upper bound P* by: 


Pi < max 



[2i-(Ln/4j +!)] + (! 
n/2 



f 2i — n 41 f 4i 

max < 0 ,- 7 -> = max < 0 , — 

( n/2 J { n 



If ^ ^ < 0, than Pi = 0, and the ratio 0 becomes 1. Hence, we only need to consider the 

case that ^ \ > 0. Using this assumption, the ratio 0 can be upper bounded by: 


n 


Pi- --2+l< --- 


4 i 


n 



2 ) + 1 = 4 - 


8 i 
n 


n 8 i 

+ 1 + 1 = 6 -- 
2 i n 


n 

2 d 


It can be easily checked that the last expression is maximized for i 
value is exactly 2 . 


n/4, and its maximum 

□ 
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4.1.2 Proof: TPRM is Two-Sided IC 


Next, we prove that TPRM is two-sided incentive compatible. For that purpose we need the 
following observation. 

Observation 4.8. Fix the reports of all mediators except for di. Let us denote by u\,U 2 , • • •, u n 
and u\, u ' 2 .... ,u' n the set {/iq, } r f = x assuming mediator di reports a set of locations whose median 
is ii and t[, respectively, where t\ > ti (t\ < ti). Then u' h > Uh (u' h < Uh) for every 1 < h < n. 
Moreover, if Uh < ti (uh > ti) then Uh = u' h . 

Proof. We prove the case of t[> ti. The other case is symmetric. Assume the median of the reports 
of di is ti , and let us analyze the effect on the list u\, U 2 , ■ ■ ■, u n when this median changes to t[. 
Following the change some Uh values that are equal to ti increase to t[, and possibly move to later 
places in the list. From this description, it is clear that Uh < u' h for every 1 < h < n. Moreover, 
the part of the list containing values smaller than ti is unaffect, and therefore, Uh = u' h whenever 
u h < ti. □ 

Lemma 4.9. The Two Percentiles Range Mechanism is agent-side IC. 

Proof. Consider an arbitrary agent a l ,j. We would like to show that, given that di is truthful, it 
is non-beneficial for aij to report a fake location t\ ■ instead of its true location tij. Let ti and t\ 
denote the median of the reports (breaking ties towards zf) of mediator di assuming aij reports its 
true location ti and the fake location t\, respectively. If ti = t\, then reporting the fake location t\ ■ 
is certainly non-beneficial for a ?J . Otherwise, assume without loss of generality that ti < t' % . By 
Observation 13.41 ti is on the path between tij and tf-, i.e., tij < ti. 

Let U\,U 2 , ■ ■ ■ ,u n and u\ ,u' 2 , ..., u' n denote the u^s before and after the deviation of a^j . By 
Observation 14.81 u' h > it/, for every 1 < h < n and Uh = u' h when Uh < ti. Hence, dist (uh,Uj) > 
dist (u' h ,tij) for every 1 < h < n because tij < ti. As TPRM selects Uh and u' h as the facility 
location with the same probability, we get that the expected distance of ti j from the facility cannot 
decrease following the deviation of a t ,j. □ 

Lemma 4.10. The Two Percentiles Range Mechanism is mediator-side IC. 

Proof. Consider an arbitrary mediator di. We should prove that it is always a best response for 
di to be truthful given that the agents of A* are truthful. Let C be the median of the locations 
reported by the agents to di, that is closest to Zi- Since the agents of Ai are truthful, t t is also 
the median of the true locations of the agents of Ai that is closest to Zj. Every strategy of di can 
be characterized by the median closest to z* of the locations reported by di on behalf of his agents 
because this is the only information used by the center. Hence, we can characterize any deviation of 
di from truthfulness by the median tl i of the locations it reports that is closest to Zj. Our objective 
is to prove that a deviation represented by i\ is no better than being truthful. 

Assume, without loss of generality, t % < t\. Let Uh and u' h be the value of the h th element in the 
list u\, U 2 , ■ ■ ■, u n , assuming di is truthful or chooses a deviation represented by t[. If Uh = u' h , then 
di clearly does not care whether the facility is located at Uh or u' h . Otherwise, by Observation 14.81 
u' h > u h > ti. 

Let p x = ti + x , and let c x be the social cost of the agents represented by di if the facility 
is located at p x . Let p be an arbitrary point, and let us observe the change in dist(p x ,p) as x 
increases. Clearly, dist^,/?) initially decreases at a constant rate and then starts to increase again 
in a constant rate (possibly the decrease or increase part is of zero length). Hence, the derivative 
of c x is a non-decreasing function in x (expect for a finite number of x values for which it might 
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not be defined). For x = 0, p x is a median of the locations of the agents of di, and therefore, c x 
is minimized for x = 0. Thus, the derivative of c x is non-negative for every x > 0, whenever it is 
defined. Let y and y' be the values of x for which Uh = p y and u' h = p y i. Then, we have y < y', 
and therefore, c y < c y >, i.e., locating the facility at u' h induces no better social cost for the agents 
of Ai in comparison to locating it at Uh- Hence, mediator di does not prefer that the facility will 
be located at u' h over locating it at Uh- 

Consider the following process. We start with u\, U 2 , ■ ■ ■, u n , and at each step pick some Uh and 
replace it with u' h (i.e., the probability mass that was given to Uh moves to u' h ). Notice two things. 
First, Uh and u' h have the same probability to be selected by the center (since they have the same 
index), and therefore, no step improves the social cost of the agents represented by di. Second, 
the process described starts with the configuration resulting from a truthful di, and ends with the 
configuration resulting of a deviation represented by l(. Hence, this deviation is non-beneficial for 
di. □ 

5 Lower Bounds on Two-Sided IC Mechanisms 

In this section we prove the lower bounds given in Sections [3] and 0] on the competitive ratios that 
two-sided incentive compatible mechanisms can achieve on any tree metric. All our lower bounds 
are based on a construction for the line metric on [0,1] (i.e., on the simple tree T that is just 
an interval) which is given as Example [TJ The lower bounds exploit the inability of mediator- 
side IC mechanisms to use information other than the location of the optimal point (median) for 
each mediator, due to the mediator incentive constraints. Example |T] is essentially identical to 
the construction used in b n However, while these papers only provide a lower bound for the 
deterministic case, we show that the same construction can also be used to prove a lower bound for 
the randomized case. Note that interval lower bounds automatically extend to any tree T, since 
the interval [0,1] can be mapped to any path connecting two leaves of the tree. 

Example 1. Fix r to be an arbitrary positive integer. For every pair of points h and l such that 
0 < l < h < 1 we define three instances l [' h , I I/' and I l f h . Each instance has two mediators d\ 
and d. 2 , with each mediator representing 2r + 1 agents. In If , mediator d± represents r + 1 agents 
located at l and r agents located at h, while d% represents 2r+l agents located at h. In I l f h , mediator 
d\ represents 2r + 1 agents located at l, while c ?2 represents r agents located at l, and r + 1 at h. 
Finally, in I l f h mediator d\ represents 2r + 1 agents located at l, while represents 2r + 1 agents 
located at h. 

To make use of Example HI we need to make the following observation. 

Observation 5 . 1 . In all instances l\’ h , I l f h and I l f h of ExampleUl each mediator represents agents 
with a unique median. Moreover, the unique median of the agents represented by d\ is l, while the 
unique median of the agents represented by c ?2 is h. The globally optimal location for l\’ h is to locate 
the facility at h, and has a cost of (h — l)(r + 1). Similarly, the globally optimal location for I l f h is 
to locate the facility at l, and its cost is also (h — l)(r + 1). 

Using the above observation, we can prove the following lemma. 

Lemma 5 . 2 . Consider the two instances I\ = Z^’ 1 and I 2 = Z 2’ 1 defined as in Example\]f For any 
mediator-side IC mechanism, the expected locatioi @ picked given either I\ or I 2 must be the same. 

10 Notice that a location in a line metric is just a number, hence, one can calculate expectation over locations. 
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Proof. Denote I 3 = /3’ 1 . We show that the expected location picked given either I\ or I 2 must be 
the same since in both cases the expected location picked is the expected location picked given I 3 . 
We assume throughout the proof that the agents are truthful. 

Let Xj be the expected location of the picked location given input Ij (for j £ {1,2,3}). For 
any facility location distribution F whose support is within [0,1], the expected cost for mediator 
d\ in instance I\ is r + E[x], while the expected cost for mediator d\ in instance I 3 is ( 2 r + l)E[cc], 
where E[x] is the expected location picked according to F. We observe that for both instances 
and any distribution, the cost of d± is linearly increasing in the expected location of the facility. 
This means that for both inputs, mediator d\ prefers a distribution with lower expected location. 
That is, if 0 < xi < X 3 then given I 3 mediator d\ prefers to pretend that the input is I\, and if 
0 < X 3 < x\ then given I\ mediator d\ prefers to pretend that the input is I 3 . We conclude that 
x\ = X3. Similar arguments show that X2 = X3, and thus, x± = X2- □ 

We are now ready to prove our lower bounds. 

Theorem I3.3L Fix any constant e > 0. Then, no direct revelation deterministic mechanism that 
is mediator-side incentive compatible has a competitive ratio of 3 — e, even for line metrics. 

Proof. Consider an arbitrary direct revelation deterministic mediator-side incentive compatible 
mechanism, and assume all players are truthful. To prove the theorem we fix r > 2e _1 , and 
consider two instances I\ = /j 1 ’ 1 and I 2 = 1 2’ 1 as defined in Example [U By Lemma 15.21 (with a 
single point distributions determined by the deterministic algorithm), the location picked given I\ 
must be the same as the location picked given I 2 . Let us denote this location by p. 

If p = 0 then for I\ the cost is 3r + 1 while the optimal cost is r + 1. The resulting competitive 
ratio is = 3 — > 3 — £ > 3 — e (as r > 2e~ 1 ). Similar arguments show that if p = 1 

then the competitive ratio is again worse than 3 — e for 12 - This completes the proof for the case 

p e { 0 , 1 }. 

Assume now p £ (0,1), and consider the location x' 2 £ [0,1] picked by the mechanism given the 
input I 2 = I 2 ’ p . If x ' 2 < p then given I 2 mediator prefers to pretend that the input is I 2 , and if 
x 2 > p then given input I 2 mediator d 2 prefers to pretend that the input is I 2 - We conclude that 
x 2 = p. 

The social cost of locating the facility at p given input P 2 is (3r + 1 )p, while the optimal social 
cost for this instance is (r + 1 )p. The competitive ratio achieved is, therefore, no more than: 
^r = 3-4 T >3-f >3-e (asr>2£- 1 ). □ 

The lower bound for the randomized case is very similar to the deterministic case, but it uses 
Lemma 12.41 which allows us to extend the result to randomized mechanisms. 

Theorem 14.31 Fix any constant e > 0. Then, no direct revelation randomized mechanism that 
is mediator-side incentive compatible and no mediator based algorithm has a competitive ratio of 
2 — e, even for line metrics. 

Proof. Consider an arbitrary direct revelation randomized mechanism which is mediator-side in¬ 
centive compatible, and assume all players are truthful. To prove the theorem we fix r > e _1 , 
and consider two instances I\ = I^’ 1 and I 2 = I 2 ’ 1 as defined in Example [1] By Lemma 15.21 the 
expected location for I\ must be the same as the expected location for I 2 . Let p be this expected 
location. 

By Lemma El for any set of agents, replacing a distribution whose expected location is x 3 by 
deterministically locating the facility at Xj only decreases the cost. This means that the performance 
of the mechanism is no better than the worse performance (over I\ and I 2 ) achieved by locating 
the facility deterministically at some p £ [0,1]. The cost of I\ when the facility is located at p is: 
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p(r + 1) + (1 — p){3r + 1), and the cost of 1-2 is p{3r + 1) + (1 — p)(r + 1). The maximum of these 
costs is: 

max{p(r + 1) + (1 — p)(3r + l),p(3r + 1) + (1 — p)(r + 1)} 

> \p( r + !) + (!-P)(3r + 1)] + [p(3r + !) + (!- p)(r + 1)] = 2r + 1 

which induces a competitive ratio of at least: = 2 — >2 — ^ > 2 — e (as r > e _1 ). 

This completes the proof for direct revelation randomized mechanisms that are mediator-side in¬ 
centive compatible. The theorem holds also for mediator based algorithms because such algorithms 
cannot distinguish between the inputs I®’ 1 and I®’ 1 . □ 

6 Multiple Levels of Mediation 

In this section we present results for the case of multiple levels of mediation. We can represent a 
hierarchy of mediators by a tree (note that this tree is not the same tree as the one defining the 
metric). The root of the tree represents the center, each internal node represents a mediator, and 
each leaf represents an agent. The tree is common knowledge. Let s be the depth of the tree, or 
the maximal number of edges between the root and a leaf. The case we have studied so far is thus 
represented by a tree with three levels (s = 2), the root represents the center, the internal level 
represents the mediators, and the leaves represents the agents. 

As before, a player is either an agent or a mediator. For each player, another player is a direct kin 
if that other player is either a descendent in the tree, or an ancestor in the tree. Recall that a direct 
revelation mechanism is a mechanism in which each agent reports her location, and each mediator 
reports all of the locations of the agents below him. In such a mechanism, we say that a player is 
truthful if she is an agent and she reports her location truthfully, or he is a mediator and he reports 
all of the locations of the agents below him truthfully (as received from his direct descendants). 
While our results from the previous sections show the existence of competitive mechanisms that 
are two-sided incentive compatible for trees with a single layer of mediators, the next theorem 
proves that even for a much weaker solution concept, ex-post incentive compatibility, competitive 
mechanisms with multiple layers of mediators are impossible. 

Definition 6.1. A direct revelation mechanism is ex-post incentive compatible if for every player, 
being truthful always maximizes the player’s utility, assuming that all other players are truthful. 

We note that this solution concept is weaker than being two-sided IC (or the natural gener¬ 
alization of this notion to more levels) as it does not require any player to ever have a dominant 
strategy in an induced game with only part of the other players (his direct kins) being truthful. 
Unfortunately, it is impossible to construct competitive ex-post IC mechanisms for s > 2. 

Theorem 6.1. No mechanism with a finite competitive ratio is ex-post IC for instances with s > 2. 

Proof. Consider the following instance on the metric interval [0, 2] with s = 3. 

• Level 3 contains the center. 

• Level 2 contains a single mediator C, reporting to the center. 

• Level 1 contains two mediators, which we will call A and B, both reporting to C. 

• Level 0 contains five agents: 

— Agents a,b and c are represented by mediator A, and located in 0,0 and 1, respectively. 
— Agents d and e are represented by mediator B, and are both located at 2. 
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Assume for the sake of contradiction that there exists a finite competitive ratio mechanism 
which is ex-post IC for the above instance. Observe that if the center gets a report stating that 
all five agents are located at a point p, then the mechanism must locate the facility at p to have 
a bounded competitive ratio. Assuming all players are truthful, except maybe for C. Then the 
following observations hold: 

• Let S be the multi-set of the positions of the agents as reported to C. Then, S represents 
the true locations of all the agents. 

• The center observes the report of C. 

Let m be S’s median. By Observation 12.21 m is the optimal facility location for C. If C deviates 
and reports that all agents are located at m, the center will have to locate the facility at m, due to 
the above discussion. Thus, for C to have no incentive to deviate, the center must always locate 
the facility at the median of the locations it gets. 

Consider now the situation that all players are truthful, except maybe for A. Let us considers 
A’s situation. A is reported, correctly, that his agents are located at 0,0 and 1. Thus, the optimal 
facility location for A is 0. If A reports truthfully, the center will get the reports 0,0,1,2 and 2, 
and, by the above discussion, will locate the facility at 1. On the other hand, if A deviates, and 
reports that all his agents are located at 0, the center will get the reports 0, 0,0, 2 and 2, and will 
locate the facility at 0 (which is better from A’s point of view). Thus, the mechanism considered 
is not ex-post IC. □ 

It seems that in any mechanism that satisfies “minimal” incentive properties, the only “useful” 
part of the information reported by a player to its ancestor is the optimal location (from the per¬ 
spective of that player) with respect to that player’s input. That is, the mechanism’s output cannot 
change if the locations reported by a player are all replaced by the optimal location of that player 
(with respect to that player’s input). This observation naturally suggests the following mechanism 
which generalizes the Weighted Median Mechanism. The mechanism iteratively computes weighted 
medians (breaking ties in a report-independent way) for each mediator, bottom up, and outputs 
the final location. Consider the example presented in the proof of Theorem 16.11 The suggested 
mechanism would behave as if A reports that all his agents are located at 0, and C reports that 
all his agents are located at 0 (as this is the median of his input which is 0,0,0,2,2). The final 
outcome would be to locate the facility at 0. Notice that this outcome, while not optimal, is not 
too far from optimality. 

This raises few questions which we address next. First, assuming that all players are truthful, 
what is the approximation that this mechanism achieves? Second, what kind of incentive property 
does this mechanism satisfy? And finally, for this incentive property, is there any other mechanism 
that is substantially better? 

We begin by establishing some notation. Let djj be the j th node of the i th level in the tree, 
where the level of the leaves that are furthest away from the root is i = 0 (notice that is the 
root). For every 0 < i < s, let mi denote the number of nodes appearing in level i of the tree. 
For every mediator dij we denote by Qj the set of children of dij in the tree and by A t ^ the set 
of leaves (agents) that descent from d t ^. For consistency, if d l; j is an agent, then Cjj = 0 and 
Aij = {di.j }. Finally, for every agent a, let t a denote the location of a. 

We can now formally define the mechanism. We note that this is not a direct revelation 
mechanism, but rather, each player is asked to report a single location. 

In the Iterative Weighted Median Mechanism (IWMM): 

• An agent reports her location. 

• A mediator reports the weighted median of the reports he gets. More formally, let £ij be the 
report of player djj and let Zij be an arbitrary point selected independently of any reports. 
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Then, lij is the weighted median of the multi-set {(t)_ij/, | dj-ij' £ Cjj} closest 

to Zij. 

• The center locates the facility at the point it would have reported according to the above 
rule, if it were a mediator. 

Notice that IWMM is well defined by Observation 13.11 The next proposition summarizes the 
competitiveness of IWMM when viewed as an algorithm (with respect to its input). 

Proposition 6.2. Assume that in Iterative Weighted Median Mechanism every agent is truthful 
and every mediator follows the protocol, then the mechanism has a competitive ratio of 2 s — 1 for 
any tree metric. 

Proof. Let Fo pt and OPT denote an optimal location for the facility and the corresponding social 
cost, respectively. We prove by induction that for every level 0 < * < s, X^j=i Yha&Ai, dist (£ij,t a ) < 
(2* — 1) • OPT , assuming all the players follow the mechanism. Notice that plugging i = s implies 
the lemma since the center locates the facility at i s _\. 

Let us start with the base of the induction. Every agent doj of level 0 reports its location. 
Thus, Iqj = td 0j and Aqj = {dij}. Using these observations we get: 
m 0 mo 

22 22 dist (4j,4) = 5Z d ist(4j,ido,,) = 0 = (2° - 1) ■ OPT . 

3= 1 aeA 0 j j=1 

This completes the proof of the induction base. Assume the claim holds for i! > 0, and let us 
prove it for i = i' + 1. Given an agent a of level il or lower, let j(i 1 . a) be the (single) index of a 
player of level i' for which a £ Ay If <k,j is a mediator, then, by Observation 12.21 

22 d ist(4j,4-ij(i-i, 0) )< 22 dist(FoPT, 4_lJ(i_l,a)) • 

Qi£Ai f j CL^zA'i^j 

Let M % be the set of mediators on level i, then summing the above inequality over all j £ Mi gives: 

E E d\st(£ij , ^i_l J(i_l,a)) — E E d\St(FoPT,£i-l,j(i-l,a)) ■ 

j£Mi a^zAij aGAij 

Using the triangle inequality, we get: 

E E [dist(4j-,*a) - dist(t a , 

j€Mi a(E:Aij 

s E E d E E dist(FoPT, (■i-l j(i— l,a)) 

jeMi aeAij j&Mi a£A it j 

- E E [dist(EoPT,ta) + dist(f a ,^_ 1 j(*_ 1 ,a))] ■ 

jeMi aeA itj 
Rearranging, gives: 

E E dist (£i,j,t a ) < E E [dist(F 0 pr,ia) + 2 • dist(i a ,4_ij (i _i )a) )] 

a^Ai^j j(E.Mi a(^Aij 

<OPT+ 2• E E dist (t a , £i-i,j(i-i ia )) 

j&Mi aeAij 

mi-i 

= OPT + 2• E E dist(t a ,4-i,.j) 

3 =1 aeAi- 1J 

< OPT + 2 • (2* _1 - 1) • OPT = (2* - 1) • OPT , 
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where the first equality holds since both double sums sum over all the agents appearing below level 
i, and the third inequality is due to the induction hypothesis. The induction step follows since for 
every j 0 Mj, dij is an agent, and thus, dist(£jj, .) = 0 , which implies: 

mi 

E E d ' st (kj,ta) = E E dist (4^«) < ( 2 * -1) • opt . □ 

j = 1 j£Mi a^Aij 

By Theorem 16.11 the direct revelation implementation of IWMM is not ex-post IC. Yet, this 
mechanism satisfies the following, much weaker, incentive property. Informally, every mediator, 
assuming that all his ascendants follow the protocol, and that the input he received from each 
of his direct descendants represents the true location of all agents of that direct descendant, will 
optimize his perceived utility by following the protocol. 

Definition 6.2. A single-location mechanism is a non-direct revelation mechanism in which each 
player is reporting a single location to its direct ascendant. 

A player of a single-location mechanism is straightforward if she is an agent and is truthful, 
or if he is a mediator and the location he reports is optimal with respect to his utility function 
assuming every agent represented by a direct descendant is located at the location reported by that 
direct descendent. 

For a player in a single-location mechanism, we say that being straightforward is naively op¬ 
timal, if being straightforward maximizes the above utility function under the assumption that his 
ascendants are straightforward 

A single-location mechanism is naively incentive compatible if for every player being straight¬ 
forward is naively optimal. 

We note that the term naive comes to emphasize that the players do not form Bayesian beliefs 
regarding the true locations of their agents and they do not try to optimize with respect to that 
belief, but rather (naively) assume that the reports they receive represent the true locations of the 
agents. Such a naive behavior is consistent with a mediator that never allows himself to harm his 
agents in the case when the reported locations are actually the true locations of his agents. 

In Section 16.11 we prove that IWMM is naively incentive compatible. Combining this with 
Proposition 16.21 we derive the following theorem which summarizes the properties of IWMM. 

Theorem 6.3. The Iterative Weighted Median Mechanism (IWMM) is a deterministic naively 
incentive compatible mechanism with a competitive ratio of 2 s — 1 for any tree metric. 

Note that although our incentive property is extremely weak, the competitive ratio of IMWW 
degrades exponentially in s. Can such bad performance be avoided? We conclude this section with 
a lower bound showing that up to constant factors, it cannot. 

Theorem 6.4. Fix any constant e > 0 and tree depth s > 3. Then, no mechanism (possibly 
randomized) that is naively incentive compatible has a competitive ratio of 2 S ~ 2 — 1 — e even for 
line metrics. 

The proof of Theorem 16.41 is based on the following construction. 

Example 2. Consider the metric on the interval [0,1]. Fix r to be an arbitrary positive integer, 
and fix a depth s > 3. We next describe a pair of instances. In both instances, the center has a 
single child mediator. Every mediator of level i > 1 represents 2 mediators of level i — 1, and every 
mediator of level 1 represents r agents, except for d \ ; i which represents r + 1 agents. Notice that 
there are in total r ■ 2 S ~ 2 + 1 agents in each instance of the pair. 
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The first instance, denoted by I\, has all the agents located at point 0. The second instance, 
denoted by I 2 , has all the agents located at 1, except for the agents of di t i which are located at 0. 

Lemma 6.5. Any (possibly randomized) mechanism that is naively incentive compatible and has a 
finite competitive ratio must locate the facility at 0 with probability 1 given either I\ or I 2 . 

Proof. The claim is trivial for I\ since all the agents of I\ are located at 0. Notice that the center 
of I\ has a single child mediator, and therefore, gets only a single report (which is equal to 0). Let 
us prove that the center gets the same report also under I 2 , and therefore, the center must behave 
in the same way for both instances. 

Formally, we need to prove that d s -i.i reports 0. We do so, by proving inductively that dpi 
reports 0 for every 1 < i < s — 1. Notice that all the agents of di i are located at 0, and therefore, 
dpi must report 0 to be straightforward. This completes the base of the induction. Assume 
reports 0, and let us prove that so does dpi for 2 < i < s — 1. Notice that dpi gets only two 
reports. One report from dj_p 1 of value 0 which represents r ■ 2 l ~ 2 + 1 agents, and another report 
which represents only r • 2*~ 2 agents. Clearly, the optimal location for dpi’s utility function (with 
respect the reports it gets) is 0, because this is the reported location of the majority of the agents. 
Therefore, dpi must report 0 to be straightforward. □ 

We are now ready to prove Theorem 16.41 

Proof of Theorem \6-4\ Fix a given naively IC mechanism with a bounded competitive ratio. The 
mechanism must locate the facility at 0 with probability 1 given I 2 ■ However, there are only r + 1 
agents located at 0 while there are r ■ (2 s-2 — 1) agents located at 1. Thus, the mechanism has a 
cost of: r • (2 s-2 — 1), while the optimal facility location has a cost of no more than r + 1. The 
competitive ratio of the mechanism is, therefore, no better than: 

r ■ (2 S ~ 2 - 1) 
r + 1 

which tends to 2 s-2 — 1 as r goes to infinity. □ 

6.1 Proof: IWMM is Naively IC 

In this section we prove that IWMM is Naively IC. This proof tends to follow the same ideas 
used in the analysis of WMM, but there are some subtle differences due to the hierarchical natural 
IWMM and the different solution concepts of the two algorithms (more specifically, WMM is a 
direct revelation mechanism, while IWMM is a single location mechanism). Let us start with a 
generalization of Observation 13.41 for hierarchical settings. 

Lemma 6.6. Fix two mediators dij and dp y for which d^ji is a descendent of d t j in the tree. 
Assume is changed to be £i'j> by some change in the strategies of dy ^ and its descendents. Let 
£ij be the new location of tp^ after the change. If every direct ascendant of dy ^ is straightforward 
and f'pj 7^ i{ j, then and f'pp are both on the path between Iv j and lyy, and ly y is closer to 
&ij than to fpj. 

Proof. We prove the lemma by induction on i — i'. For i — i' = 0 the lemma holds trivially, so let 
us now assume the claim holds for * — i! < h, and let us prove it for i — if = h. We also assume 
every ascendor of dpj/ is straightforward and fpp 7^ £ t .j (otherwise, there is nothing to prove). 

Let dj _i jn be the (single) direct descendent of dpj which is an ascendant of dp y. Let \ jn be 
the new value of after the change in li'y. If = £i-ij", then we get lij = £ t .j which 
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contradicts our previous assumptions. Thus, we must have 4-ij" 7^ By the induction 

hypothesis, this implies that 4-1 j" and 4-i,j" are both on the path between 4'j' and 4',j'> and 
4- i jh is the one closer to 4'j' among the two. 

By Observation 13.41 4j and 4,j are both on the path between 4-ij" and 4-i,j") and 4,j is 
closer to 4—i,j" than 4j- Combining this with the previous observation, we get that the path from 
Ij'j' to 4',j' goes through the points 4-ij" 5 4j 5 4,j and 4-in that order, which completes the 
proof of the induction step. □ 

Using the above lemma, we can now prove that IWMM is naively IC. 

Lemma 6.7. The Iterative Weighted Median Mechanism is naively incentive compatible. 

Proof. Consider an arbitrary player dij. We should prove that being straightforward is naively 
incentive compatible for dj j. In the rest of this proof let 4j denote the report of dij assuming he 
follows the mechanism. Let 4j ^ 4,j be an arbitrary possible deviation of d ir Our objective is to 
prove that assuming all of dij ascendants are straightforward, then reporting 4j is no better for 
dij than being straightforward, with respect to the utility function induced by the reports of <4j’s 
direct descendents. Let 4,1 and 4,1 be the locations picked by the center given that dij reports 
4y and £ t .j. respectively. If 4,1 = 4,1 then the deviation is clearly non-beneficial. Otherwise, by 
Lemma [6761 the path from 4y to 4j goes through 4,1 and 4,1 in that order. 

For every x € [0,1], let p x be the point along the path between 4j and 4y obeying dist (£ij,p x ) = 
x • dist (£ij,£ij), and let c x be the value of the djj’s objective function (with respect to the reports 
he gets from his direct descendents) if the facility is located at p x . Let p be an arbitrary point, 
and let us observe the change in dist (p x ,p) as x increases. Clearly, dist^r^p) initially decreases at 
a constant rate and then starts to increase again in a constant rate (possibly the decrease part or 
increase part is of zero length). Hence, the derivative of c x is a non-decreasing function of x (except 
for a finite number of x values for which it might not be defined). We know that for x = 0, p x is an 
optimal point for djj. Hence, c x is minimized for x = 0. Thus, the derivative of c x is non-negative 
for every x > 0, whenever it is defined. Let y and y' be the values of x for which 4,1 = p y and 
4,i = P y '- Then, we have y < y ', and therefore, c y < c y /, i.e., locating the facility at 4,1 does not 
decrease the objective function of d t j in comparison to locating it at 4,l- Hence, deviating to £ l .j 
is non-beneficial from djj’s perspective. □ 

7 Conclusion 

We studied the impact of strategic mediators on the competitive ratio of IC mechanisms in a facility 
location setting. Our results show that a single layer of mediation cause a moderate degradation in 
the competitive ratio, which becomes much worse as additional layers of mediation are introduced. 
We also showed that randomized mechanisms perform better than deterministic ones. 

Strategic mediators appear in many real world scenarios, and we believe it is important to 
study the implications of their behavior in various settings. For example, in display advertising, 
one common practice is for a mediator to buy advertisement space on a web page and split it between 
multiple advertisers he represents. Assume the mediator gets bids from potential advertisers, and 
based on these bids decides how to bid for the space (in the ad exchange auction). If he wins 
the space he also need to decide how to split the newly bought space among his advertisers. The 
mediator needs a strategy that will incentivize his advertisers to be truthful, which is not a trivial 
task even if the ad exchange uses a second price auction. Studying the effect on the social welfare 
and revenue of such mediators is an interesting open problem. 
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APPENDIX 

A Missing Proofs of Section [2| 

A.l Proof of Observation [2721 

In this subsection we prove the following observation. 
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Observation 12.21 For every non empty finite multi-set S of weighted points, there is at least one 
weighted median. Moreover, a point p is a weighted median of S if and only if locating the facility 
at p minimizes the weighted total cost of a set of aqents located at the points of S (i.e., the sum 

E$ e s w (v)' ffa S ) • dist (p> q))- 

The observation follows immediately from two lemmata. The first lemma shows that any multi¬ 
set S of weighted points always have a weighted median. This lemma is a simple extension of a 
well-known result of Jordan [5j. 

Lemma A.l. For every non empty finite multi-set S of weighted points there is at least one weighted 
median. 

Proof. Given a point p, let V p denote the partition of S into maximal sub multi-sets such that 
every path between two weighted points of different sub multi-sets of V p goes through p. Observe 
that a point p is a weighted median of S if and only if all the sub multi-sets of V p are of weight at 
most w(S)/2. Let R = {p \ 3 w (p,w) E 5} U V. Pick any point r in R. If r is a weighted median, 
we are done. If not, there is exactly one sub multi-set P E V r with a weight larger than w(S)/ 2. 
By the definition of R there must be a point r' E R (r f r) such that the path between every 
weighted point of P and r goes through r'. We claim that if r' is not a weighted median, then there 
must be a sub multi-set P' E P r i that is a subset of P \ S r and has weight larger than w(S)/2. If 
this is so, we can replace r by r' and P by P' and continue in the same way till we find a weighted 
median (the sum YlqeP di s t(<Z, r) strictly decreases with each iteration, and can take only a finite 
set of values). 

Next we show that if r' is not a weighted median, then there must be a sub multi-set P' E Vp 
that is a subset of P and has weight larger than w(S)/2. Since r' is not a weighted median, there 
exists a sub multi-set P" E Vp with weight over w(S)/2. If P" \ P 0, then by the definition of 
r' it holds that P" D P = 0. Therefore, w(P") < w(S) — w(P) < w(S ) — w{S)/2 = w(S)/2, which 
is a contradiction. Thus, we must have P" CP. □ 

The next lemma proves the second part of Observation 12.21 i.e., that the weighted median 
points are exactly the optimal facility locations for minimizing the weighted total cost of a set of 
agents located at the weighted points of S. 

Lemma A.2. Given a non empty finite multi-set S of weighted points, a point p is a weighted 
median of S if and only if locating the facility at p minimizes the weighted total cost of a set of 
agents located at the weighted points of S (i.e., the sum Eggs w (q) ' /(?,£) • dist(p, q)). 

Proof. Consider two points p and q, where p is a median of S (such a point exists due to the 
previous lemma). Assume the facility is located at q, and let us analyze the effect on the cost paid 
by the agents of S if the facility is moved to p. Let S' C S be the multi-set of weighted points 
of S that are connected to q via a path that does not go through p. By the definition a median, 
w(S') < m p < w(S)/2. 

The path in T from every weighted point of S \ S' to q goes through p. Hence, the move of the 
facility from q to p decreases the cost of agents located at these weighted points by dist (p,q). On 
the other hand, the move can increase the cost of an agent located at any other weighted point by 
at most dist(p, q). Hence, the change in the weighted total cost following the move of the facility is 
at most: 


w(S') ■ dist(p, q) — w(S \ S') ■ dist(p, q) = (2 w(S') — w(S)) ■ dist(p, q) (3) 

< (2 • (w(S)/2) — w(S)) ■ dist(p, q) = 0 . 
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Thus, we proved that locating the facility at a median point p of S induces as low a weighted 
total cost for the agents of S as locating it in every other point. In other words, any median point 
p of S minimizes the weighted total cost incurred by a set of agents located at the weighted points 
of S. To prove the other direction, let us tighten the above analysis. Assume q is not a median of 
S. Then, there exists a set S" C S such that the weighted points of S" can be connected via paths 
that do not go through q, and w(S") > w(S)/2. 

If S" fl S' = 0, then w(S') < w(S ) — w(S") < w(S) — w(S)/ 2 = w(S)/ 2. Plugging this 
into <(3]) shows that the change in the total weighted cost paid by the agents of S following the 
move of the facility from q to p is strictly negative. Otherwise, we note that S" % S' because 
w(S") > w(S)/2 > w(S'). Hence, there exist two weighted points q' £ S" 0 S' and q" € S" \ S'. 
Since q' £ S' and q" S' , we learn that the path connecting q' to q" goes through p. However, 
this path cannot go through q because both q' and q" are in S". This implies that the additional 
cost of q' from the move of the facility from q to p is lower than dist(p, q). However, in the analysis 
of © we upper bounded this additional cost by dist (p,q), hence, 0 is a strict upper bound on the 
change in the weighted total cost paid by the agents of S. 

In conclusion, we proved that the assumption that q is not a median of S implies that the total 
weighted cost of a set of agents located at the weighted points of S strictly decreases if the facility 
is moved from q to p. Hence, any non-median point like q does not minimize the total weighted 
cost of such a set of agents. □ 

A.2 Proof of Observation 12.31 

Observation [2731 There exists a deterministic, agent-side incentive compatible mechanism that is 
optimal (1-competitive). 

Proof. This proof uses some results from Section [3l We would like to stress that this does not 
introduce circular references between our proofs. Consider the following mechanism. Let z be an 
arbitrary point chosen independently of the reports of the agents, and let S be the multi-set of 
the reports of all agents (as reported by the mediators to the center). We consider the mechanism 
which chooses as the facility location the unique median of S closest to z. 

The existence of a unique median point closest to z (i.e., closer to 2 than any other median point) 
is guaranteed by Observation 13.11 The optimality of the above mechanism follows immediately 
from Observation 12.21 Thus, we are only left to show that the mechanism is agent-side incentive 
compatible. 

Consider an arbitrary agent a t ] . Assuming mediator di is truthful, we would like to show that 
ajj has no incentive to report a fake location iO ^ tij. Let F and F' be the facility locations 
picked by the mechanism assuming atj reports tjj and fO, respectively. Similarly, let S and S' be 
the multi-sets of the reports of all agents (as reported by the mediators to the center) assuming 
a.i.j reports tij and t\ ■, respectively. If F = F', then reporting the fake location t' l ? is clearly 
non-beneficial for a t ,j. Otherwise, observe that since di is truthful, S' = S U {tf ? } \ {Uj}. Hence, 
by Observation 13.41 F and F' are both on the path connecting tij and t\ ■, and F is closer to tij 
than F'. Thus, dist (tij,F) < dist (tij,F'), i.e., it is strictly non-beneficial for to report the fake 
location t'- ■. □ 

A.3 Proof of Lemma 12.41 

In this subsection we prove the following lemma. 
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Lemma 12.41 For any distribution over points F there exists a point p(F) such that for any finite 
multi-set S of points: 


E 

p'r^F 


^2d\st(p',q) ■ f(q,S ) 

ges 


>Y^d\st(p(F),q) ■ f(q,S ) . 

geS 


Moreover , for the euclidian metric on [ a,b] (for arbitrary a and b) the expected location according 
to F can serve as such a point p(F). 

The proof of the lemma is done by defining a series of probability distributions F \, F 2 ,..., F^ 
that have two properties: 

• For every 1 < i < £, 


E 

p'~Fi 


E dist (p',q) ■ f(q,s) 

g£S 


> E 

p'~F i+ 1 


X! di st (p',q) ■ f(q,s) 

g£ S 


• When sampling according to F( one always gets the same point (a point distribution). 

Let us begin by defining F\. Map each node to an arbitrary edge hitting it. For every edge 
e £ E, let A(e) denote the set of points that are located on e or a node mapped to e. Observe that 
{A(e)} ee E is a disjoint partition of the metric. The probability distribution F\ is identical to F 
except that a point p chosen according to F\ never belongs to a set A(e) if Pr p '~f[p' £ A(e)\ = 0. 
Formally, let E(F) = {e £ E | Pr p ^ E \p' £ A{e)\ > 0}. Given a set S of points: 

Pr \p' £ S\ = Pr [p' G S 0 U eGE{F) A(e)} . 

p'rsjFl p'r^F 


Lemma A.3. F\ is a probability distribution, and E [dist(p', q)} 

p'r^F 

point q. 


E [dist(j/, (/)] for every 

p'rsjFl 


Proof. Clearly Pr^/^^ [p 1 £ S] > 0 for every set S of points. For any countable collection of disjoint 
sets Si, S 2 , ■ ■., we have: 


Pr 

p'^Fi 



i =1 


Pr 

p f n^>F 


P' G 


OO \ 

l+J Si] O U e&E(F)A{e) 


Pr 

p'r^F 


p 1 G y (Si n u e&E{F)A(e)) 
i= 1 


E Pr [p’ £ Si n u e&E (F)A(e)] 

‘ p'r-^F 


1=1 


E Pr . W € Si 

i=l 


Finally, 


Pr \p' £ ld e <z E A(e)\ 

p'rsjFl 


E , Pr P b 'g A ( e )\ = E FJp' G A ( e )] 

z ' p'r^Fl L ' p'r^F 

eeE y eeE(F) F 

E T v Jp ' G - E T t Jp' g a ( e )] = 1 - E ° = 1 • 

i ' p'~F ii ' 

eGB e^E(F) e£E(F) 
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Observe that for every set S of points, we have: 


Pr \p' G S] = Pr k/ G S D U eeE (F)M^)\ = f v Jp' € S \ ~ Y] Y t Jp' e S n A ( e )} 

p' ~ F± p' '~^'F p' ‘ ** p' r^F 

e^E(F) 

= Pr y G S'] - V 0 = Pr y G S 1 , 

p'~F J ^ p'~F J 

e£F(F) 


and therefore, for any point q, E [dist(j/, g)] = E [dist(p', q)]. □ 

p'r^F p f ~Fl 

Before presenting the other probability distributions (1*2, F%,... Ff), we need the following nota¬ 
tion. Given a probability distribution F 1 , and a path P in the tree T such that PnE(F') b 0, let 
A(P) = U e( zpA{e) and let F'\p be a probability distribution in which the probability mass of the 
path P is concentrated at the average of this path. More formally, let u, v be the end nodes of P, 
then for every x G [0,1], we denote by u(x) the point along P obeying dist (u,u(x)) = x ■ dist(u, v). 
For a point p' G A(P) we denote by x(p') the value for which p' = u(x). Also, let E f’[P\ = 
u (E p '~f'[x(p') I p' G A(P)]), then F'\p is a probability distribution which is identical to F' except 
that it outputs E/r/ [P] whenever F' outputs a point along the path P. 

Lemma A.4. Assume P is a path in the tree and F' is a probability distribution in which a point 
p' ~ F' can belong to A(e) if and only if e G E(F'). Then F'\p is a probability distribution, and 
E [dist(p 7 , < 7 )] > E [dist (p',q)\ for every point q. 

p'~F' p'~F'\p 

Proof. Given a set S of points: 


%y £ s|= /~fP e s \ ■ 4(F)l+ 


■ Pr>' € A(P)\ 

p' r^F' 


where 1^ [p]^s is an indicator taking the value 1 when E f'[P] G S, and the value 0 otherwise. Lets 
prove that F'\p is a probability distribution. It is clear that Pi p t^pi\ p \p' G S] > 0 for every set S 
of points. Consider an arbitrary countable collection of disjoint sets Si, £2, • • •• If E f'[P\ does not 
belong to any of the sets Si, then: 


Pr 


p'~F'\ P 



Pr 

p>~F' 


p G 



\A(P) 


= Pr 

p'~F' 


p'g|+| (Si \ A(P)) 
2=1 



On the other hand, if the point Epv[P] belongs to one of the sets Si,S 2 ,..., then it belongs to 
exactly one of them, and therefore: 


Pr 

P'^jF'\p 


00 

p' g l+J Si 
2=1 


= Pr 

p'~F' 


3 'G ( l+J S*) \ A(P) 


\i=l 


+ Pr [p' G A(P)] 

p'r^F' 


= Pr 

p'~F' 


p'e\*)(Si\A{P)) 


2=1 


+ Pr y G A(P)\ 

p'r^F' 


Y Pr [p f G Si \ A(P)\ + Pr [p' G A(P)] = V Pr \p' G Si] . 

p'~F' 1 J n'r^F' /J n f^ J jpt\ n L J 


i =1 


p'~F' 


i= 1 


p'~Fj P 
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Finally, notice that E F /[P] € U eeFJ 4(e). Therefore, 


Pr \p' G U ee E A(e)\ 

p'~F'\p 


Pr [p' G (U eeE A(e)) \ A(P)] + Pr \p' G A(P)} 
p'~F'\p p'~F' 

Pr fp' G U ee E^(e)l - Pr [p' € A(P)j + Pr \p' G A(P)] 

p'~F' p'~F' 

Pr [p' G U eG F^(e)] = 1 . 

p / ~i 7V |p 


Consider an arbitrary point q, and let q 1 be the point on P closest to q. Then, 
E [dist(p', q) | p' G A(P)] = dist(g, q') + E [dist(p', q') \ p' € -A(P)] 

p'r^F' p'~F' 

= dist (q,q')+ E [\x(p') - x(q')\ \ p 1 G A(P)] 

p'r-^F' 

= dist (q,q) + E [max{i'(p') — x(q'),x(q l ) — x(p')} \ p' G A(P)~\ 

p'r^F' 


> d ist(< 7 , q') + max 



- x{q’) | p € A(P)\ , E^ [x(q’) 


x(p') | p' € A(P )] | 


= dist(< 7 , </) + max < E \x(p') \ p' G ^4(P)1 — x(q'),x(q') 

\p'^F' 

= dist(< 7 , q') + max {x(E F /[P]) — x(q'),x(q') — x(E F /[P])} 
= dist(< 7 , q') + dist(Epv[P], q') = dist(E F /[P], q) . 


o [s(pO | p’ G A(P)] } 


If P(P') C P, then: 

Ep,, [dist(p',g)] = E^ ; [dist(p',g) | p' G A(P)] > dist(E F /[P],g) 

= E [dist(p',g) | p' G A(P)] = E [dist(p / , q)] 

p'r^F'\p p'r^F'\p 


Otherwise, 


E [dist(p', q)] 

P' ' 


= E \d\st(p',q) | p' G A(P)] ■ Pr \p' G A(P)] 
p'~F' L J p'~F' 

+ E [dist(p', < 7 ) | p' 0 t 4(P)1 • Pr [p' ^ -A(P)] 
p'~F' L J p'~F' 

>dist(E F /[P],g)- Pr [p'Gi(P)] 

p'~F< 

+ E [dist(p',g) |p'gA(P)l • Pr [p' ^ ^4(P)] 

p'~F' L J p'~F' 

= E [dist(p / ,g 1 ) | p' G j 4(P)1 • Pr [p' G -A(P)] 

p'^i^'lp p'r^F' 

+ E [dist(p',g) | p' £ A(P)] • PrJp' £ A(P)] 

p'~F'|j> p'~F' 

= E [dist(p / , g)] . 

p'^F'lp 


□ 


Using the machinery developed above, we can now define the following process which constructs 
the distributions P 2 , P 2 ,..., Fi . For a probability distribution P', let Q(F') denote the set of points 
that can result from Pb 


1. Let z G- 1. 

2. If E(Fi) > 1, then: 


32 



3. Let Pi be a path containing at least two edges of E{Ff). 

4. Let Fi- )_i i — FAp,. 

5. Else if \Q(F')\ > 1: 

6 . Let Pi be the path containing only the single edge of E{Fi). 

7. Let F^ |_i •£- Fj\p i . 

8 . Else: 

9. Terminate the process. 

10 . i <— i + 1 . 

11. Return to line 2. 


Observation A. 5. The above process always terminates. 

Proof. Observe that \E(Fi)\ > |E(F,; + i)| unless \E(Fi + i)\ = 1 (in which case the process terminates 
after producing Fj_|_i). □ 


Let l be the index of the last probability distribution produced by the process, and let p(F) be 
the single point that can be outputted by Fp Let q be an arbitrary point. From Lemmata IA.3I and 
IA.41 we get: 


E [dist(p', ( 7 )] = E [dist(j/, g)l > E [dist(p',g)] >...> E [dist(p', g)l 

' r^j JT Jpt r^j F\ 'jV r^j Po /v JT „ 


p'r^F 


p'~f 2 


p'~F e 


= dist {p(F),q) 


Consider now an arbitrary finite multi-set S of points. Then, by the linearity of the expectation: 


E 

p'~F 


^2<i\st(p,q) ■ f(q,S ) 
q£S 


= , E r [ dist 0o ?)] • f(q, S )>Y^ dist (p( i? )> q) ■ /(•?> s ) > 

qGS F qes 


which completes the proof of the first part of the Lemma 12.41 Observe that if the metric is the 
euclidian metric on [a, b], then there are two cases. 

• Q(F) = {g} for some point q. In this case £ = 1, and p(F) = q = E [//]. 

p'r^F 

• I <2(^)1 > 1. In this case, one possible execution of the above process chooses P\ which is equal 
to the entire interval [a, b\. For this execution, we get £ = 2 and p(F) = E \p’\ = E [//]. 

p , rsjF\ p'~F 

Thus, in both cases E \p'] can serve as p(F). 

p'r^F 


B Analysis of TRM 

In this appendix we analyze TRM and prove the following theorem. 

Theorem 14.21 For any tree metric, the Tree Randomized Mechanism (TRM) is a randomized 
two-sided IC mechanism with a competitive ratio of 2. 

The next two lemmata prove that p(u) defines a legal probability distribution. Subsections lB.il 
and IB.2I prove the competitive ratio and the two-sided IC of TRM, respectively. 

Lemma B.l. For every node u £ X, p{u) £ [0,1]. 

Proof. First, let us consider a node u £ A\{?’}. For such a node c(u ) = n/4 and n/2 > treesize(u) > 
n/4 because r is a weighted median of L. Hence, by the definition of X: 

treesize(n) — (treesize(n / ) — n/4) < treesize(n) < n/2 , 

u'^X Dchildren ( u) 
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which implies: 


p(u) 


treesize(u) — c(u) 
n/2 


Eu'gxnchiidren(»)( treesize (^) ~ c ( u ')) n/2 - n/ 4 

n /2 — n /2 


1/2 . 


Also, treesize(n) —^ p , eXncllildren ^(treesize(y) —n/4) > n/4 because, either |Xflchildren (u)\ = 
0 or: 


treesize(n) — (treesize(u') — n/4) > n/4 + treesize(n) — treesize(n / ) 

ii/EXnchildren(n) it'EXnchildrer^n) 

> n/4 + treesize(u) — treesize(i/) = n/4 + size (it) > n/4 . 

ti'Gchildrenfp) 

Thus, 

treesize(u) — c(u) S«'exnchiidren(«)( treesize ( u '0 “ c ( u ')) n/4 - n/4 

v ; n /2 n /2 “ n /2 

We are now left to prove that 0 < p(r) < 1. Clearly, c(r) = n/2 and treesize(?’) = n. By the 
definition of X we get: 

treesize(r) — (treesize(n / ) — n/4) < treesize(r) = n , 

li'GXnchildren(r) 


which implies: 

, , _ treesize(r) - c(r) _ E«'gxnchiidren(r)( treesize (y) ~ c ( u ')) < n - n/2 _ ^ 

P[r) ~ n/2 n/2 ~ n/2 

Also, treesize(r) - E„' e xnchiidren(r)( treesize ( u ') “ ra / 4 ) > n / 2 because: 

• If | A fl children(r)| < 1: 

treesize(r) — (treesize(n / ) — n/4) > treesize(?’) — treesize(i/) 

u' EX nchildren(r) u' EX Dchildren(r) 

> treesize(?’) — \X D children(r)| • n/2 > n/2 . 


If \X D children(r)| > 2: 

treesize(?’) — (treesize(n / ) — n/4) > n/2 + treesize(r) — treesize(i/) 

u 1 gXnchildren(r) u 1 godchildren (r) 

> n /2 + treesize(?’) — treesize(i/) = n /2 + size(r) > n /2 . 

li'Gchildren(r) 


Thus, 

/ x _ treesize(r) - c(r) _ Eu'gonchiidren(r)( treesize ( n/ ) ~ c ( u ')) > n/2 - n/2 
P 1 n/2 n/2 ~ n/2 

Lemma B.2. For every node ueX, E«'gonsubtree( u ) P( u> ) = tIcca ™M~ c(u ) , hence, for u = r, we 

9et E«'g xP( u> ) = L 
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Proof. We prove the lemma by induction on the size of \X n subtree(w)|. If \X D subtree(w)| = 1, 
then X n children(u) = 0, and therefore, 

_ treesize(u) - c(u) J2 u 'exnchiidren(u)( treesize ( u ') ~ c ( u )) _ treesize(w) - c(u) 

P U ~ n/2 n/2 _ nj2 

Assume now that the claim hold for | Ansubtree(u) | < h, and let us prove it for |Xnsubtree(ii)| = h. 
Every point v! € X n children(w) obeys: X n subtree(u') CAD subtree(u) \ {u}, and therefore, 
\X n subtree(d/)| < \X n subtree(u)|, he., the induction hypothesis holds for u'. Thus, 


pK) 

w'GXnsubtree(w) 


: p( u ) + ^2 p ( u '} 

u'^X Dsubtree ( u ) 
u'^u 

treesiz e(u) - c(u ) E u ' G xnchiidren(u)(treesize(u') - c(u')) 


n/2 

+ E 

u' GX Dchildren( u) 

treesize(u) — c(u) 

n/2 


n /2 


treesize(n') — c(u') 
n/2 


□ 


B.l Proof: TRM is 2-Competitive 

In this subsection we prove that TRM is 2-competitive as an algorithm (when players are truthful). 
Let V' = V U {ti.j 11 < i < k Al < j < n^}, where V is the set of nodes of the tree produced by the 
first part of TRM. Intuitively, V' contains V and the real locations of all agents. The definition of 
V 1 calls for the following extensions of treesize, subtree, X and c: 

• For a point q outside of V, let u q be the first node of V encountered when moving from q 
away from r. Then, treesize^) = treesize(n g ) and subtree(g) = subtree(n q ). 

• Let X' = {q G V' treesize (q) > n/4}. 

• For every point q € X' \ X, let c(q) = n/4. 

We also define S as the set of segments connecting pairs of points q and q' of V' that are not 
separated by any other point of V'. Observe that since V C V\ the interiors of the sections of S 
are disjoint. 

Observation B.3. There exists an optimal facility location F in V'. 

Proof. Observe that every median of the multi-set A = {ti j |1 < i < k A 1 < j < ni} is an optimal 
facility location, and let F be an arbitrary median of A. If F E V 7 , then we are done. Otherwise, 
we also have F fL A. Let F’ be a point of V' which is closest to F among all points of V. Consider 
an arbitrary multi-set S' of points from A \ {F'} that are connected by paths that do not pass 
through F'. By the dehnition of F', every path between points of A that passes through F also 
passes through F'. Thus, the points of S' are connected by paths that do not pass through F. 
Hence, |«S ,, | < |A |/2 because F is a median of A. Since S' is an arbitrary multi-set of points from 
that are connected by paths that do not pass through F', F' is a median of A that belongs 
to V'. □ 

Observation IB.31 implies that for every agent a,;.j the path between F and Uj is a union of 
segments from S with disjoint interiors. It can be easily seen that the same is true also for each 
facility location that TRM might select. Let us focus on an arbitrary segment s € S connecting 
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two points q, q' £ V', where q' is the point closer to r in the pair (there must be such a point since 
V C V'). We say that an agent ajj uses s under the optimal solution if the path from t t j to F goes 
through s (by Observation IB. 31 we know that the path must either go through all of s or though 
no part of it). Similarly, the probability that a* j uses s under TRM is the probability that the 
path from a t) j to the facility location chosen by TRM goes through s. Observe that in order to 
prove the approximation ratio of TRM, it is enough to prove that the number of agents using s 
under the optimal solution is at least half of the expected number of agents using s under TRM. 
Let treesize(s) denote the number of agents whose location is in the subtree of q. More formally, 

treesize(s) = | {ajj | the path from tij to r goes through g}| . 

The rest of this subsection is split between two cases: treesize(s) < n/2 and treesize(s) > nj2. 


B.1.1 The case of treesize(s) < n/2 

Let us first consider the case treesize(s) < n/2. 

Lemma B.4. treesize(q') < 2 • treesize(s). 

Proof. For an agent to be counted by treesize(g), it must belong to a mediator di obeying t % £ 
subtree (n^). Since U is the median of the (reported) locations of the agents, £ t £ subtree(n q ) 
implies that least half of the agent in A t are located in the subtree rooted at q (because all the 
agents of Ai located outside of this subtree can be connected via paths that does not go through 
£i ). Thus, whenever the agents of Ai are counted by treesize(g), at least half of the agents in Ai 
are also counted by treesize(s). □ 

We can now upper bound the expected number of agents using s under TRM. 

Observation B.5. Lemma \B.‘2\ holds for every point s G X'. 


Proof. If s G X there is nothing to prove. Otherwise, recall that u s is the first node of V encountered 
when moving from s away from r. Then, 


p( u "> = p ( u "> 

w'Gsubtree(s) tt / Esubtree(w s ) 


treesiz e(u s ) — c(u s ) 
n/2 


treesize(s) — c(s) 
n/2 


where the last equality holds because neither s nor u s can be r. □ 

Lemma B.6. The expected number of agents using s under TRM is at most: max{6 • treesize(s) — 
Sn^ 1 • (treesize(s)) 2 — n/2, treesize(s)}. 

Proof. Clearly q ^ r, and therefore, c(q) = n/4. We have to distinguish between two cases. The 
first case is q G X'. In this case, by Observation IB.51 the probability that TRM will locate the 
facility below s in the tree is (treesize(g) — c(q))/(n/ 2). If the facility is indeed located below s, 
then the number of agents that will use s under TRM is n — treesize(s). However, if the facility is 
located above s, then treesize(s) agents will use s. In conclusion, the expected number of agents 
that use s under TRM is: 

teesizefa) - c(g) , _ + G_ treesizefa) - c(,)\ 

n/2 \ n/2 J 

treesize(g) — c(q) 

~ nj2 


(n — 2 • treesize(s)) + treesize(s) 
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2 ■ treesize(s) — c(q) , n - 7 —. . N - ; — r ^- 

<- - - ■ In — 2 ■ treesize(sj) + treesize(s) 

n /2 v 

(2 • treesize(s) — n/4) • (n — 2 • treesize(s)) + treesize(s) •(n/ 2 ) 

n/2 

3n • treesize(s) — 4 • (treesize(s )) 2 — n 2 /4 - : —■ „ , .- - —. .. 9 

=-—- 7 -—-— = 6 • treesize(s) - 8 n _1 • (treesize(s )) 2 - n /2 . 

n/2 

Let us now consider the second case, i.e., the case that q ^ X'. Notice that in this case 
treesize(g) < n/4, and therefore, no point of X' is located in the subtree rooted at q. This implies 
that the probability that the facility will be located below q is 0 , i.e., exactly treesize(s) agents use 
s under TRM. □ 

On the other hand, we can also characterize the number of agents using s under the optimal 
solution. 

Lemma B.7. The number of agents using s under optimal solution is: treesize(s). 

Proof. If F is above s in the tree, then the number of agents using s under the optimal solution is 
clearly treesize(s). 

Otherwise, treesize(s) > n /2 because I 7 is a median of A. Hence, the number of agents using 
s under the optimal solution is n — treesize(s) < n/2. Recall that we assumed treesize(s) < n/2, 
thus, we must have: treesize(s) = n/ 2 , which implies: n — treesize(s) = treesize(s) = n/ 2 . □ 

We can now conclude the proof for the case treesize(s) < n/2. 

Corollary B.8. The number of agents using s under TRM is at most twice the number of agents 
using s under the optimal solution. 

Proof. If treesize(s) agents or less use s, in expectation, under TRM, then there is nothing to prove. 
Otherwise, by Lemmata IB. 61 and IB.71 the ratio between the number of agents using s under TRM 
and under the optimal solution is at most: 

6 • treesizefs) — 8 n _1 • (treesize(s )) 2 — n /2 , -. , n 

-Ld— \ = 6 - 8 n -1 • treesize(s)- - . 

treesize(s) 2 • treesize(s) 

It can be easily checked that the last expression is maximized for treesize(s) = n/4, and the value 
of this maximum is: 

6 _ s „-' . (, V4 ) _ - r s~2-2 = 2. □ 


B.1.2 The case of treesize(s) > n/2 

Let us now consider the case treesize(s) > n/2. 

Lemma B.9. n — treesize(g) < 2 • (n — treesize(s)). 

Proof. The proof of this lemma is analogous to the proof of Lemma IB. 41 □ 

We can now upper bound the expected number of agents using s under TRM. 

Lemma B.10. If treesize(s) > 5n/8, then the expected number of agents using s under TRM is 
upper bounded by: 10 • treesize(s) — 8n -1 • (treesize(s)) 2 — 5n/2. Otherwise, it is upper bounded by 
max{10 • treesize(s) — Sn^ 1 • (treesize(s)) 2 — 5n/2, treesize(s)}. 
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Proof. Clearly q r, and therefore, c(q) = n/4. We have to distinguish between two cases. The 
first case is that q € X'. In this case, by Observation IB .51 the probability that TRM locates the 
facility below s in the tree is (treesize(g) — c(q))/(n/ 2). If the facility is indeed below s, then 
n — treesize(s) agents use s under TRM. However, if the facility is above s, then treesize(s) agents 
use s under TRM. Hence, the expected number of agents using s under TRM is: 


treesize(q) — c(q) 
n/2 

treesize(g) — c(q) 
n/2 

( c(q ) — n) + (n — treesize(g)) 
n/2 


(n — treesize(s)) + 11 — 


treesize(g) — c(q) 
n/2 

(n — 2 • treesize(s)) + treesize(s) 

(2 • treesize(s) — n) + treesize(s) 


treesize(s) 


< 


( c(q ) — n) + 2 (n — treesize(s)) 

^72 


(2 • treesize(s) — n) + treesize(s) 


(5n/4 — 2 • treesize(s)) • (2 • treesize(s) — n) + treesize(s) • (n/2) 

n/2 


5n • treesize(s) — 4 • (treesize(s)) i — 5n 2 /4 
n/2 


= 10 • treesize(s) — 8 n 1 • (treesize(s))'' : — 5n/2 . 


Let us now consider the second case, i.e., the case that q ^ X'. Notice that in this case 
treesize(g) < n/4, and therefore, no point of of X' is located in the subtree rooted at q. Thus, the 
probability that the facility will be located below s is 0, and exactly treesize(s) agents use s under 
TRM. Moreover, in this case, by Lemma IB.91 

-. . n — treesize(n) n + treesize(o) 5n „ 

treesize(s) < n -— =-— < — . □ 

w - 2 2 8 

On the other hand, we can also characterize the number of agents using s under the optimal 
solution. 


Lemma B.ll. The number of agents that use s under the optimal solution is: n — treesize(s). 

Proof. The proof of this lemma is analogous to the proof of Lemma IB. 71 □ 

We can now conclude the proof for the case treesize(s) > n/2. 

Corollary B.12. The number of agents using s under TRM is at most twice the number of agents 
using s under the optimal solution. 

Proof. If lO-treesize(s)— 8 n^ 1 -(treesize(s)) 2 —5n/2 < treesize(s) < 5n/8, then n—treesize(s) > 3n/8 
agents use s under the optimal solution, while at most treesize(s) < 5n/8 use it, in expectation, 
under TRM. The ratio between these values is 5/3 < 2. 

Otherwise, by Lemmata IB. 101 and IB. Ill the ratio between the number of agents using s under 
TRM and under the optimal solution is at most: 

10 ■ treesize(s) — 8 n -1 • (treesize(s )) 2 — 5n/2 
n — treesize(s) 

10 • [n — (n — treesize(s))] — 8 n -1 • [n — (n — treesize(s ))] 2 — 5n/2 

n — treesize(s) 
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6 • (n — treesize(s)) — 8 n 1 • (n — treesize(s )) 2 — n/2 
n — treesize(s) 

= 6 — 8 n _1 • (n — treesize(s))-. 

2 (n — treesize(s)) 

It can be easily checked that the last expression is maximized for treesize(s) = 3n/4, and the value 
of this maximum is: 


Th 

6 - 8 n _1 • (n - 3n/4)- 7 -— = 6- 2- 2 = 2. □ 

v 11 2 ■ (n - 3n/4) 

Combing the results we proved for the two cases, we get the competitive ratio of TRM. 

Corollary B.13. The Tree Randomized Mechanism (TRM) is 2-competitive. 

Proof. Corollaries IB. 81 and IB.121 prove that the expected number of agents using segment s under 
TRM is always at most double the number of such agents under the optimal solution. Hence, the 
expected contribution of s to the cost of TRM is always at most double its contribution to the cost 
of the optimal solution. The corollary now follows if we recall that s was chosen as an arbitrary 
segment of S, and that the entire cost of the optimal solution and TRM is contributed by segments 
of 5. □ 

B.2 Proof: TRM is Two-Sided IC 

In this subsection we prove that TRM is Two-Sided IC. Intuitively, the proof has three steps. 

• Showing that when a mediator deviates the “probability mass” of the facility location follows 
the median of the locations reported by the mediator. More specifically, if mediator d deviates 
and reports a set of agent locations whose median is £ instead of the true median £, then the 
“probability mass” moves towards £ inside the path connecting £ and £. Outside this path, 
the probabilities of nodes to become the facility location is unaffected. This step consists of 
Lemma IB. 141 and Corollary IB.151 

• Proving (using the first step) that whenever an agent located at t deviates and reports a 
location t ^ t, then the median of the locations reported by its mediator can only go further 
away from t, and therefore, the “probability mass” of the facility location also goes away from 
t. This step is achieved by Lemma IB. 171 

• Proving (using the first step) that whenever a mediator deviates and reports a set of agent 
locations whose median is £ instead of the true median £, then the “probability mass” is pushed 
away from £, and therefore, things become worse for the agents of the deviating mediator. 
This step is achieved by Lemma IB. 181 

Recall that V is the set of nodes of the tree produced by the first part of TRM. Let us begin 
with the first step. 

Lemma B.14. Assume mediator di reports one of two multi-sets Si and Si of agent locations. Let 
u and u be the medians of Si and Si closest to z, respectively. Assume that the path between u and 
u goes through no other point ofV. Let p(v), X,V and p(v), X ,V denote the value ofp(v), the set 
X and the set V given that di reports Si and Si, respectively. Also, let P = {u,u}, then: 

• For a node v G V \ P, p(y) = p(y). 

• p(u) < piS 1 )- 

• p(u) + p(u) = p(u ) + p(u). 
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Proof. For every node dGVU {u}, let treesize(u), subtree(u), c(v) and treesize(u), subtree(u), c(v) 
denote the values of treesize(u),subtree(u) and c(v) given that d, reports Si and Si, respectively. 
If u V, then we define treesize(u), subtree (u) and c(u ) like in Subsection lB.il Similarly, if u V, 
then we define treesize(u), subtree (u) and c(u) like in Subsection IB.II In the rest of the proof we 
distinguish between two cases. 

First case: In the first case reporting either one of the multi-sets Si and Si induces the same 
root r. Let us begin by proving that p(v) = p(v) for every point v € V \ P. Let a be the lowest 
common ancestor of u and u. Clearly, a € P since no other point of V appears on the path between 
u and u. The subtree of every point v € V \ P which is not an ancestor of a is the same (including 
the size of the nodes) regardless of which multi-set d* reports (Si or Si), and therefore, p(v) = p(v) 
because p(v) depends only on the subtree of v and on the question whether v is the root. Consider 
now a point v £ V \ P which is an ancestor of a, and let v' be the child of v that is an ancestor 
of a when d* reports Si (possibly v' = a). Similarly, let v' be the child of v that is an ancestor of 
a when d* reports Si (possibly v' = a and/or v' = v'). Observe that the declaration of d* affects 
only the subtree of a, and therefore, does not affect any child of v except for v' and v'. Moreover, 
either v' = v' or they are both in P, and thus, treesize(r/) = treesize(fZ) and v' G X v' € X. 
Since these are the only two properties of v' and v' used by the formula of p(v), we get, once again, 
p(v) = p(v). 

By Lemma lB.21 the sum of the probabilities of all points of V is 1. Thus, the change in p(u) 
following the deviation plus the change in p(u) must add up to 0. Formally, 

[p(u) — p(u )] + \p(u) - p(u)\ = 0 => p(u) + p(u) = p(u) + p(u) . 

By the definition of X, if a ^ X, then subtree(a) n X = 0 for the two declarations of d* we 
consider. Since both nodes of P are in the subtree of a, we get in this case that every point v G P 

has p(v) = p(v) = 0. Hence, the interesting case is when a £ X, which we assume next. 

Assume u = a. Notice that this implies u € X. We have three subcases to consider based on 
the membership of u in X and X. 

• If u fL X, then u X because the median of Si is in u, and the median of Si is outside of 
it’s subtree. Hence, p(u) = 0 = p(u), which implies p(u) = p(u), since we already proved 
p(u) + p(u) = p(u) + p(u). 

• If u € X and u 0 X, then p(u) = 0 < p(u), which implies p(u) < p(u) (again, since we already 
proved p(u) + p(u) = p(u) + p(u )). 

• If u G X and u G X, then recall that u is not in the subtree of u and c(u) = c(u) because 

u ^ r. Thus, by applying Lemma lB.21 twice (once for p and once for p): 


p(u) = 


treesize(ii) — c(u) 
ri/2 

treesize('u) — n* — c(u) 
n/2 


E p( i 


L»Gsubtree(w) 

v^u 




uGsubtree(w) 

v^u 


2/Tli 

n 


which implies p(u) < p(u), and the proof continues as before. 
The case u = a is analogous, and therefore, we omit it. 
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Second case: In the second case reporting each one of the multi-sets S t and Si induces a different 
root node r. By Observation 13.41 the root r must move from u to u. Therefore, the deviation does 
not affect the subtree of every point v € V \ P. Since p(v) depends only on the subtree of v and 
on the question whether v is the root, we get that p(v) = p{v) for every point v £ V \ P. 

Using the same argument as in the first case, we get from Lemma IB. 21 that p(u) +p{u ) = p(u) + 
p(u). If u X then p(u) = 0 < p(u), and we are done. Similarly, if u fL X then p(u) = 0 < p(u), 
which implies p(u) < p(u ) since we already proved p(u) +p(u) = p(u ) + p(u). Thus, the interesting 
case is when both u £ X and u G X, which imply also u € X and «£l. 

Notice that subtree(u) = V \ subtree (u) and V = subtree(u), thus subtree(fi) = subtree(u) \ 
subtree('u). Moreover, treesize('u) = n — treesize(u) — m. Combining both observations: 


p{u) = 


treesize(u) — c(u) 


n/2 

(*) treesize(u) — c(u) 


n/2 

[n — treesize(u) — n^] — c(u) 
n/2 

n — Hi — c(u) — c(u) 


p^ 

^Gsubtr ee(u) 
v^u 

p ^ + Y p ^ 

treesize(n) — c(u) 


L’Gsubtree('u) uGsubtree(w) 

v^u 


Y p^ + 


^Gsubtree('u) 

v^u 


n/2 


n/2 

(**) treesize(n) — c{u) — ni 
n/2 


Y p^ 


i’Gsubtree(n) 

v^u 


Y p^ = p( 


= mu - 


2 m 


n 


vGsubtree^) 

v^u 


where (*) holds because subtree(u) = subtree(n) \ subtree(n) and (**) holds treesize(n) = n, 
c(u) = c(u) = n/4 and c(u) = n/2. Hence, p(u) < p(u). □ 

The next corollary generalizes the last lemma to the case where the two multi-sets 5/ and Si 
that di can declare have medians that are further apart. 

Corollary B.15. Assume mediator di reports one of two multi-sets Si and Si of agent locations. 
Let u and u be the medians of Si and Si, respectively. Let p(v) and p{y) denote the value of p(y) 
given that di reports Si and Si, respectively. We also define P(u , u) to be the path between u and 
Ui, and let u = w±,W 2 , ■ ■ ■ ,w s = u denote the nodes along P(u,u). Then, 

• For a node v G V \ P(u , u), p(y) = p(y). 

• For 1 < k < s, Ej=fcPK) < Ej=kP( w j)- 

Proof. Let us consider s multi-sets S = S\, S 2 , ■ ■ ■, S s = S that di can declare, where the median of 
Sh is Wh, and let Ph(v) denote the value of p(y) assuming di declares the multi-set Sh- Lemma IB.141 
gives the following properties of the relation between pj, and Ph+i for every 1 < h < s. 

• A point v £ V\P(u , u) is not the median of either Sh or Sh+i, and therefore, Ph(v) = Ph+iiv). 

• For k > h + 1: Ph('wf) = Ph+\{wj) for every j > k, and thus, Yfj=kPh(wj) = Y^j=kPh+i( w j)- 

• For k = h + 1: p h {w k ) < Ph+i(wk) and p h (yjj) = p h+1 (wj) for every j > k. Thus, 
Tfj=kPh{wj) < T, B j=k Ph+i(w j ). 
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• For k < h: Ph(wj) = ph+i(vjj) for every j such that k < j < h or h + 1 < j. Also, 
Ph(Wi) +p h {w h+ 1 ) = p h+ l{w h ) + p h+ l(w h+1 ). Thus, Yfj=kPh{Wj) = Yfj=kPh+l{Wj). 

Combining the above properties, we can conclude: 

• For a point v £ V \ P(u, u), p(v) = pi(v) = P 2 (v) = ... = p s (v) = p(v). 

• For every 1 < k < s, J2j=kP( w i) = Ej= fe PiK) < T,j=kP^ w j) < ■■■ < T,j=kPs( w i) = 

Ej=kP( w i)- D 

Next, we prove a simple technical observation that is used by the proofs of the following lemmata. 

Observation B.16. Given a series vq,v\, ..., v rn of points in the tree such that the path from vq 
to v m goes through v\, V 2 , ■ ■ ■, v m -\ in this order, and values {Af,Bf | 1 < / < m} such that: 
Yl™=h Af < Y^f=h Bf f or every 1 < h < m. Then: 

m m 

^2 d 'st (,v 0 ,v h ) ■ A h < ^2 dist(u 0 , v h ) ■ B h . 

h =1 h =1 

Proof. Observe that dist(wo, Vh) = E/=i dist(v/— i , Vf). Therefore, 


22 dist ( u o, Vh) ' A h =2Z\ d ' st i v h-i,v h ) ■ 22 A i 


h= 1 


h =1 


f=h 


< 22 ( dist(w/i_i,Uh) • 22 Bf ] =5^ d \st(v 0 ,v h ) ■ B h . 

f=h 


□ 


h =1 


h= 1 


In the rest of this appendix, we use Corollary IB.151 and Observation IB.161 to prove that TRM 
is two-sided IC. 


Lemma B.17. The Tree Randomized Mechanism (TRM) is agent-side IC. 

Proof. Consider an agent a,j, and assume di is truthful. We would like to show that it not beneficial 
for Ojj to deviate and report a fake location tij instead of its real location fjj. If the deviation does 
not affect the median of the agents’ locations reported by di, then the deviation does not affect the 
outcome of TRM, and is clearly not helpful for . Hence, the interesting case is when the median 
of the locations reported by di changes following the deviation from u to u. By Observation 13.41 u 
and u are both located along the path connecting tij and tij, and u is closer to Uj than u. 

Let P(u, u) denote the path connecting u and u, and let us denote the points of V U {11} along 
the path P(u, u) by u = w\, W 2 , ■ ■ ■, w s = u. Also, let p(v), V and p(y), V denote the probability 
p(v) and the set V before and after the deviation, respectively. We can now evaluate the cost paid 
by a iyj : 

S 

22pi v )- d ' st (v,tij) = 22 p( w ) ' dist + 22p( u ’h) ■ dist (w h ,tij) 

vGV v£V\P(il,u) h= 1 

s 

< 22 p(v)- d '^(v,t iJ )+22p( w h)- d 'st(w h ,t i , j ) = 22 p(v) ■ dist(w, tij) , 

vGV\P(u,u) h =1 

where the inequality follows from Observation IB.161 (when vq = tij, Vh = w k , A^ = p(wh) and 
Ph = p(uvh) f° r every 1 < h < s) and Corollary IB.15I Notice that the leftmost hand side is the 
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cost of cijj before the deviation, and the rightmost hand side is the cost of a t ] after the deviation. 
Hence, the above inequality proves that aij has no incentive to deviate. The lemma follows since 
dij is an arbitrary agent. □ 

Lemma B.18. The Tree Randomized Mechanism (TRM) is mediator-side IC. 

Proof. Consider an arbitrary mediator di , and let us assume the agents of di are truthful. Let £4 
be a median of the reports of all the agents represented by di, and let li be an arbitrary point. Our 
objective is to show that the expected social cost of the agents of A{ does not improve if di deviates 
and reports a set of agent locations whose median is £4. 

Let p(v) and p(v) denote the probability p(y) before and after the deviation, respectively. Also, 
let li = W\,W 2 , ■ ■ ■ ,w s = li denote the points of VU{li} along the path from £4 to li. To simplify the 
notation, let p A (v) = p(v) —p(v) be the change in p(v) following the deviation. By Corollary IB. 151 
p^{y) = 0 for every point v G V which is not on the path between £* and li. Hence, the change in 
the expected social cost of the agents represented by dj following the deviation is: 

rii s 

EE [dist (Uj,w h ) ■ p A (w h )\ . (4) 

j= 1 h =1 


To prove the lemma, it is enough to show that the last expression is always non-negative. Let us 
partition Ai into two kinds of agents: A[ contains agents for which the path connecting them to £4 
goes through £ 4 , and A'f contains the rest of the agents represented by d t . Notice that all the agents 
of A” are connected to each other via paths that do not go through Hence, by the definition 
of a median, \A"\ < |Aj|/2, which implies |A'| > \A"\. For every agent a ? ;j G A[ and point Wh, we 
have dist(tjj, wjf) = dist(tjj, If) + dist(£j, w\f). Hence, 


E E [dist (Uj,w h ) ■p A (w h )] 

a i,j ^=1 


S 


S 


= E E[ dist (^>^)-p A K)]+ E E[ dist (^^)-p A M 

h = 1 ai’j h = -1 


- E 


dist (t itj ,li) -Ep A K) 
h=1 


+ \K\ ■ E [ dlSt ( £ i’ W h) ■P A (Wh )] 

h=1 


= \ A i\ ' E [ dist (^> w ft) -p A (^h)] , 

h= 1 


(5) 


where the last equality follows from Corollary IB. 151 since: 

s s s 

&w = i- E p=1_ E = E^™*)^ ^2p A ( w i) = ° • 

h= 1 Dey\{i0i,i02,...,^s} i)Gy\{uii,ui2,...,«)s} h=1 h=l 


For every agent G A'f, let Wh(i,j) denote the first point from the list w\,W 2 , ■.. ,w s that appears 
on the path from tij to li. Then, 

S 

E EI d ist (t it j,w h ) -p A (w h )] 

h= 1 
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h{i,j) s 

E El d ist (t it j,w h ) • p A (w h )] + E E [dist (tij,Wh) ■p A {w h )] 

a i,j £A 'i h=h(i,j )+1 


a i,j^ A i h=1 

_ h(i,j) 

= X X [&st(ti,j,w h )-p A (w h )] + x 

&i,j G-A^ h= 1 

S 

+ E E [dist(w /l( j J -) +1 ,'u; ft ) ■p A {w h )\ 

a i,j£ A " h=h(i,j )+2 

_ h(i,j) _ 

> X] X [ dist (*ij>“ft) •p A (^)] + X 


dist(iij, u , h(ij)+i) E p A (w h ) 

h=h(i,j )-\-1 


ZijGA" h—1 


^eA" 


E p A (w h ) 

h=h(i,j)~\-l 


where the inequality follows from Observation IB.161 (when j-)+i, Vh = Wh+l+h(i,j)i 

A h = 0 and B h = p{w h+1+h{:iij ))-p{w h+ x +h ^) for every 1 <h< s-h(i,j)-l) and Corollary EED 
Notice that for every h < h(i,j ): dist (tij,Wh) = dist— dist (£i,Wh). Hence: 

h(i,j) 

E E I d ist (tij,w h ) ■p A {w h )] 

&i 7 G h = 1 


E 

a i,j 


HiJ) 

dist (ti tj ,£i) • X P A ( w h ) 
h=i 


_ h(*b) 

X X [&st(l i ,w h )-p A (w h )] . 

a i,j£A" h =1 


Addition ally, d istfoj, w fe(i|j)+ i) > dist(tjj,£;) - dist(£j,Wfc^+i), and Efc=fc(ij)+iP A (™/>) > 0 
by Corollary IB. 151 Hence: 


E 

i.ieA'f 


dist {ti,ji w h(i,j)+l) E P A {w h ) 

h=h(i,j )-\-1 


a E 


E 


dist(iij,A) • X ^ A (^) 

s 

dist(iij,A) • X 

h=h(i,j)~ f -1 


E 




dlSt(^) ^/i(2j)+l) E p A (w h ) 

h=h(i,j )~(-1 


X x NU)/W] 

a i,j& A " h=h(i,j )+1 


+ E E [dist(u;^(jj)_|_i, ru/j) -p A (^)] 

a i,j& A i h=h(i,j )+2 


> 


E 


dist(iij,A) • X P A ( w h) 

h=h(i,j )-\-1 


X x [ dist (^>^) -p a (w’/i)] , 

i,j& A " h=h(i,j )+1 


where the second inequality follows again from Observation IB.161 (when uo = ' ( Oi(ij)+i > v h = 
A = 0 and B h = p(w h+1+h{iJ) ) - p{w h+1+h[itj) ) for every 1 < h < s - /i(i, j) - 1) and 
Corollary IB. 151 Combing the last results, we get: 


E El- ist (tij,w h ) ■p A (w h )] 

i , e A" h =i 


44 




























a E 


dist (UjJi) • ^p A K) 
h=1 


s 

- E E [dist(^,w fe ) •p A (w h )] 

aijeA" h=1 


= -\ A i\-^2[ d ' st (ti, w h)-p A (wh)\ , 
h =1 


(6) 


where the equality holds since X^=i[p(^/i) — p(rc/i)] = 0 by Corollary IB. 151 Adding up Expres¬ 
sions © and ([6]) we get that dH) is lower bounded by: 


72j S 

EEi di st (ti,j,w h )-p A (w h )\ > (IA- 


i=i ft=i 


|^"|) • X! [dist(^,^h) -p A (w h )] 

h=1 


s-1 


(Wl - K'|) ■ ^ 


h=l 


dist(w h ,w h+1 ) ■ E P A (wh) 

j=h +1 


>0 , 


where the second inequality follows since the internal sum is non-negative due to Corollary IB.15I 
and \A[\ > \A'(\. ' □ 
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